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1. Introduction to Symmetry and Particles 


Symmetry simplifies the description of physical phenomena. It plays a particularly impor- 
tant role in particle physics, for without it there would be no clear understanding of the 
relationships between particles. Historically, there has been an "explosion" in the number 
of particles discovered in high energy experiments since the discovery that atoms are not 
fundamental particles. Collisions in modern accelerators can produce cascades involving 
hundreds of types of different particles: p, n, II, K, A, X... etc. 

The key mathematical framework for symmetry is group theory: symmetry transfor- 
mations form groups under composition. Although the symmetries of a physical system are 
not sufficient to fully describe its behaviour - for that one requires a complete dynamical 
theory - it is possible to use symmetry to find useful constraints. For the physical systems 
which we shall consider, these groups are smooth in the sense that their elements depend 
smoothly on a finite number of parameters (called co-ordinates). These groups are Lie 
groups, whose properties we will investigate in greater detail in the following lectures. We 
will see that the important information needed to describe the properties of Lie groups is 
encoded in “infinitessimal transformations", which are close in some sense to the identity 
transformation. The properties of these transformations, which are elements of the tangent 
space of the Lie group, can be investigated using (relatively) straightforward linear algebra. 
This simplifies the analysis considerably. We will make these rather vague statements more 
precise in the next chapter. 

Examples of symmetries include 


i) Spacetime symmetries: these are described by the Poincaré group. This is only an 
approximate symmetry, because it is broken in the presence of gravity. Gravity is the 
weakest of all the interactions involving particles, and we will not consider it here. 


ii) Internal symmetries of particles. These relate processes involving different types 
of particles. For example, isospin relates u and d quarks. Conservation laws can be 
found for particular types of interaction which constrain the possible outcomes. These 
symmetries are also approximate; isospin is not exact because there is a (small) mass 
difference between m, and mg. Electromagnetic effects also break the symmetry. 


ii) Gauge symmetries. These lead to specific types of dynamical theories describing 
types of particles, and give rise to conserved charges. Gauge symmetries if present, 
appear to be exact. 


1.1 Elementary and Composite Particles 


'The fundamental particles are quarks, leptons and gauge particles. 
The quarks are spin 1/2 fermions, and can be arranged into three families 


Electric Charge (e) 
u (0.8 GeV) c (16 GeV) |t (175 GeV) H 
d (&0.3GeV)|s (0.5 GeV) b (4.5 GeV) — 


|o» 


The quark labels u, d, s, c, t, b stand for up, down, strange, charmed, top and bottom. 
'The quarks carry a fractional electric charge. Each quark has three colour states. Quarks 
are not seen as free particles, so their masses are ill-defined (the masses above are "effective" 
masses, deduced from the masses of composite particles containing quarks). 

The leptons are also spin 1/2 fermions and can be arranged into three families 


Electric Charge (e) 


e (0.5 MeV) L^ (106 MeV) T (1.8 GeV) —1 


Ve (< 10 eV) v, (< 0.16 MeV) V. («18 MeV) 0 


The leptons carry integral electric charge. The muon p and taon 7 are heavy unstable 
versions of the electron e. Each flavour of charged lepton is paired with a neutral particle 
v, called a neutrino. The neutrinos are stable, and have a very small mass (which is taken 
to vanish in the standard model). 

All these particles have antiparticles with the same mass and opposite electric charge 
(conventionally, for many particles, the antiparticles carry a bar above the symbol, e.g. 
the antiparticle of u is u). The antiparticles of the charged leptons are often denoted by 
a change of — to +, so the positron et is the antiparticle of the electron e^ etc. The 
antineutrinos 7 differ from the neutrinos v by a change in helicity (to be defined later...). 


Hadrons are made from bound states of quarks (which are colour neutral singlets). 


i) The baryons are formed from bound states of three quarks qqq; antibaryons are formed 
from bound states of three antiquarks qqq 


For example, the nucleons are given by 


p = uud : 938 Mev 
n = udd : 940 Mev 


ii) Mesons are formed from bound states of a quark and an antiquark qq. 


For example, the pions are given by 


qt=ud  : 140 Mev 
a =dt : 140 Mev 
n° = uu, dd superposition i 135 Mev 


Other particles are made from heavy quarks; such as the strange particles K* = us 
with mass 494 Mev , A = uds with mass 1115 Mev, and Charmonium v = cc with mass 
3.1 Gev. 

The gauge particles mediate forces between the hadrons and leptons. They are bosons, 
with integral spin. 


Mass (GeV) Interaction 
y (photon) 0 Electromagnetic 
wr 80 Weak 
W- 80 Weak 
VAN 91 Weak 
g (gluon) 0 Strong 


The gluons are responsible for interquark forces which bind quarks together in nucleons. 
It is conjectured that a spin 2 gauge boson called the graviton is the mediating particle 
for gravitational forces, though detecting this is extremely difficult, due to the weakness of 
gravitational forces compared to other interactions. 


1.2 Interactions 

There are three types of interaction which are of importance in particle physics: the strong, 
electromagnetic and weak interactions. 

1.2.1 The Strong Interaction 


The strong interaction is the strongest interaction. 


e Responsible for binding of quarks to form hadrons (electromagnetic effects are much 
weaker) 


e Dominant in scattering processes involving just hadrons. For example, pp — pp is 


an elastic process at low energy; whereas pp — ppm^-c is an inelastic process at 


higher energy. 


e hesponsible for binding forces between nucleons p and n, and hence for all nuclear 
structure. 


Properties of the Strong Interaction: 


i) The strong interaction preserves quark flavours, although qq pairs can be produced 
and destroyed provided q, q are the same flavour. 


An example of this is: 


+ u u + 

T J s K 
d S a 

P u u X 
u u 


The X* and K* particles decay, but not via the strong interaction, because of con- 
servation of strange quarks. 


ii) Basic strong forces are “flavour blind". For example, the interquark force between 
qq bound states in the v» = cc (charmonium) and Y = bb (bottomonium) mesons are 
well-approximated by the potential 


Vis ^ + Br (1.1) 


and the differences in energy levels for these mesons is approximately the same. 


The binding energy differences can be attributed to the mass difference of the b and 
c quarks. 


iii) Physics is unchanged if all particles are replaced by antiparticles. 


The dynamical theory governing the strong interactions is Quantum Chromodynamics 
(QCD), which is a gauge theory of quarks and gluons. This is in good agreement with 
experiment, however non-perturbative calculations are difficult. 


1.2.2 Electromagnetic Interactions 


The electromagnetic interactions are weaker than the strong interactions. They occur in 
the interactions between electrically charged particles, such as charged leptons, mediated 
by photons. 

The simplest electromagnetic process consists of the absorption or emission of a photon 
by an electron: 


This process cannot occur for a free electron, as it would violate conservation of 4- 
momentum, rather it involves electrons in atoms, and the 4-momentum of the entire atom 
and photon are conserved. 

Other examples of electromagnetic interactions are electron scattering mediated by 
photon exchange 


VOL X UP UE 
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and there are also smaller contributions to this process from multi-photon exchanges. 
Electron-positron interactions are also mediated by electromagnetic interactions 
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Electron-positron annihilation can also produce particles such as charmonium or bot- 


tomonium 


AZAOAVUPYPL 


The dynamic theory governing electromagnetic interactions is Quantum Electrody- 


namics (QED), which is very well tested experimentally. 


Neutrinos have no electromagnetic or strong interactions. 


1.2.3 The weak interaction 


The weak interaction is considerably weaker than both the strong and electromagnetic 


interactions, they are mediated by the charged and neutral vector bosons W7 


- and Z? 


which are very massive and produce only short range interactions. Weak interactions 


occur between all quarks and leptons, however they are in general negligable when there 


are strong or electromagnetic interactions present. Only in the absence of st 
electromagnetic interactions is the weak interaction noticable. 


rong and 


Unlike the strong and electromagnetic interactions, weak interactions can involve neu- 


trinos. Weak interactions, unlike strong interactions, can also produce flavour change in 


quarks and neutrinos. 


The gauge bosons W~ carry electric charge and they can change the flavour of quarks. 


Examples of W-boson mediated weak interactions are n — p + e^ + ve: 


xl 


aa 


as 
"3 


and u^ — € + De + vy: 
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u > 
FN 
e i E 
Ve 
T 
e 
and y,+n— uw +p 
Vi > > uc 
pt 
W 
d u 
n d d p 
u u 
The flavour changes within one family are dominant; e.g. 
€ € Ve, Ho 0v 
uo d, ces (1.2) 


whereas changes between families, like u — s and u — b are “Cabibbo suppressed". 


The neutral Z?, like the photon, does not change quark flavour; though unlike the 
photon, it couples to neutrinos. An example of a Z? mediated scattering process is Pe~ 
scattering: 


ee 


e » » e 


In any process in which a photon is exchanged, it is possible to have a Z? boson 
exchange. At low energies, the electromagnetic interaction dominates; however at high 
energies and momenta, the electromagnetic and weak interactions become comparable. 
The unified theory of electromagnetic and weak interactions is Weinberg-Salam theory. 


1.2.4 Typical Hadron Lifetimes 


Typical hadron lifetimes (valid for most decays) via the three interactions are summarized 


below: 
| Interaction Lifetime (s) 
Strong 10722? — 10:25 
Electromagnetic | 10716 — 10724 
Weak 10-7 — 10-15 


with the notable exceptional case being weak neutron decay, which has average lifetime 
of 10?s. 


1.3 Conserved Quantum Numbers 


Given a configuration of particles containing particle P, we define N(P) to denote the 
number of P-particles in the configuration. We define various quantum numbers associated 
with leptons and hadrons. 


Definition 1. There are three lepton numbers. The electron, muon and tauon numbers 
are given by 


Le = N(e~) - N(e*) + N(ve) — N(%) 
Ly = N(u)— N(u*) + N (va) — N (pp) 
L. = N(T7) — N(r*) + N(v.) - N(©-) (1.3) 


In electromagnetic interactions, where there are no neutrinos involved, conservation 
of L is equivalent to the statement that leptons and anti-leptons can only be created or 
annihilated in pairs. For weak interactions there are more possibilities, so for example, an 
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election e^ and anti-neutrino v, could be created. Lepton numbers are conserved in all 
interactions. 
There are also various quantum numbers associated with baryons. 


Definition 2. The four quark numbers S, C, B and T corresponding to strangeness, 
charm, bottom and top are defined by 


— N(t)) (1.4) 


These quark quantum numbers, together with N(u) — N(ü) and N(d) — N(d), are 
conserved in strong and electromagnetic interactions, because in these interactions quarks 
and antiquarks are only created or annihilated in pairs. The quark quantum numbers are 
not conserved in weak interactions, because it is possible for quark flavours to change. 


Definition 3. The baryon number B is defined by 


1 i 
B = SUNG) - N@) (5) 
where N(q) and N(q) are the total number of quarks and antiquarks. Baryons therefore 
have B = 1 and antibaryons have B = —1; mesons have B = 0. B is conserved in all 
interactions. 


Note that one can write 


B- F (N(u) N(u) + N(d) - N(d)--C-T—5- B) (1.6) 


Definition 4. The quantum number Q is the total electric charge. Q is conserved in all 
interactions 
In the absence of charged leptons, such as in strong interaction processes, one can write 
1 E » 
3(N(d) - N(d) - S - B) (1.7) 
Hence, for strong interactions, the four quark quantum numbers S, C, B, T together 
with Q and B are sufficient to determine N(u) — N(u) and N(d) — N(d). 


Qi SVG) N(à)4- C 4 T) 
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2. Elementary Theory of Lie Groups and Lie Algebras 


2.1 Differentiable Manifolds 


Definition 5. A n-dimensional real smooth manifold M is a (Hausdorff topological) space 
which is equipped with a set of open sets U® such that 

1) For each p € M, there is some U? with p € U? 

2) For each U?, there is an invertible homeomorphism xq : U* — R” onto an open 
subset of R” such that if U^ NUP Æ () then the map 


£90 Lg | : alU NUP) > zg(U* nU?) (2.1) 


is smooth (infinitely differentiable) as a function on R”. 

The open sets U* together with the maps £a are called charts, the set of all charts 
is called an atlas. The maps £a are local co-ordinates on M defined on the U%, and have 
components z?, for i = 1,...,n. So a smooth manifold looks locally like a portion of R”. 

A n-dimensional complex manifold is defined in an exactly analogous manner to a real 
manifold, with R” replaced by C" throughout. 


Definition 6. Suppose M is a m-dimensional smooth manifold, and N is a n-dimensional 
smooth manifold, with charts (U%, x4), (W4, ya) respectively. Then the Cartesian product 
X = MxN isam-+n-dimensional smooth manifold, equipped with the standard Cartesian 
product topology. 

The charts are V^^^ = U? x W^ with corresponding local co-ordinates 


Za,A = Ta X UA: U? x w^ a pRa (2.2) 


Definition 7. Suppose M is a m-dimensional smooth manifold, and N is a n-dimensional 
smooth manifold, with charts (U®, 2), (W4, ya) respectively. Then a function f : M —^ N 
is smooth if for every U* and W^ such that f(U*) n W^ zx 0, the map 


yao foa,':ta(U%) > yA(W^) (2.3) 
is smooth as a function R” — R”. 


Definition 8. A smooth curve on a manifold M is a map y : (a,b) — M where (a,b) is 
some open interval in R such that if U is a chart with local co-ordinates x then the map 


xoy: (a,b) > R” (2.4) 
may be differentiated arbitrarily often. 


2.2 Lie Groups 


Definition 9. A group G is a set equipped with a map e : Gx G — G, called group 
multiplication, given by (g1, go) > 91°92 € G for gi, go € G. Group multiplication satisfies 
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i) There exists e € G such that g è e = e e g = g for all g € G. e is called an identity 
element. 


ii) For every g € G there exists an inverse g | € G such that geg | = gt eg — e. 


iii) For all g1, 92,93 € G; g1e(goega) = (g1e9g2)egs, so group multiplication is associative. 


It is elementary to see that the identity e is unique, and g has a unique inverse g^ !. 
Definition 10. A Lie group G is a smooth differentiable manifold which is also a group, 
where the group multiplication e has the following properties 


i) The map e: G x G — G given by (g1,g2) — 91 € go is a smooth map. 


l is a smooth map 


ii) The inverse map G — G given by g > g^ 
Henceforth, we shall drop the e for group multiplication and just write g1 e go = 9192. 
Examples: 

Many of the most physically interesting Lie groups are matrix Lie groups in various 
dimensions. These are subgroups of GL(n, R) (or GL(n, C)), the n x n real (or complex) 
invertible matrices. Group multiplication and inversion are standard matrix multiplication 
and inversion. 

Suppose that G is a matrix Lie group of dimension k. Let the local co-ordinates 
be x’ for i = 1,...,k. Then g € G is described by its matrix components g^? (a?) for 
A, B — 1,...,n. The g^? are smooth functions of the co-ordinates z^. Examples of matrix 
Lie groups are (here F = R or F = C): 


i) GL(n,F), the invertible n x n matrices over F. The co-ordinates of GL(n, F) are the 


n? real (or complex) components of the matrices. 


ii) SL(n,F) = (M € GL(n, F) : det M = 1) 

iii) O(n) = (M € GL(n, R): MM? = In} 

iv) U(n) = (M € GL(n, C) : MM! = In}, where f is the hermitian transpose. 
v) SO(n) = (M € GL(n,R) : MM? =I, and det M = 1) 


vi) SU(n) = (M € GL(n,C) : MM! = I, and det M = 1}. SU(2) and SU(3) play a 
particularly important role in the standard model of particle physics. 


vii) SO(1,n — 1) = (M € GL(n, R) : M?nM = ņ and det M = 1} 
where 7 = diag (1, —1, —1,--- — 1) is the n-dimensional Minkowski metric. 


'There are other examples, some of which we will examine in more detail later. It can 
be shown that any closed subgroup H of GL(n,F) (i.e. any subgroup which contains all 
its accumulation points) is a Lie group. 
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Some of these groups are related to each other by group isomorphism; a particularly 


simple example is SO(2) & U(1). Elements of U(1) consist of unit-modulus complex 
numbers et? for 0 € IR under multiplication, whereas SO(2) consists of matrices 


R(0) = Ge pud (2.5) 


sin 0 cos 0 


which satisfy R(0+¢) = R(0)R(¢). The map T : U(1) — SO(2) given by T(e/?) = R(0) 


is a group isomorphism. 


2.3 Compact and Connected Lie Groups 


A lie group G is compact if G is compact as a manifold. Recall that a subset of U C R” 
is compact iff it is closed and bounded, or equivalently iff every sequence un € U has a 
subsequence which converges to some u € U. 

It is straightforward to see that SU (n) is compact, for if we denote the rows of M € 
SU (n) by Ra then R Rp = óAp. Hence the components M“? are all bounded |M^P| < 1. 
So it follows that if Mn is a sequence of points in SU (n), then by repeated application of the 
Bolzano-Weierstrass theorem, there is a subsequence Mp, which converges to some matrix 
N. Moreover as the constraints det Mn, = 1 and Mp, Mj, — ] are smooth functions of the 
matrix components, one must also have det N = 1 and NN? = 1 in the limit as r — oo, 
ie. N € SU (n). 

[There is a subtlety concerning convergence which we have glossed over, namely how 
one actually defines convergence. We assume the existence of some matrix norm (for 
example ||M||sup = max(|M^P])) with respect to which convergence is defined. As all 
(finite-dimensional) matrix norms are equivalent, convergence with respect to one matrix 
norm ensures convergence with respect to any norm]. 

In contrast, the Lorentz group SO(1, n — 1) is not compact. For example, consider for 
simplicity SO(1, 1). One can define a sequence of elements Mn € SO(1,1) by 


d es godin sinh n (2.6) 
sinhn coshn 


As the components of Mp are unbounded, it follows that Mp cannot have a convergent 
subsequence. Observe that as SO(1, n—1) is a Lie subgroup of both S L(n, R) and GL(n, R) 
it must follow that SZ(n,R) and GL(n,IR) are also non-compact. 

A Lie group G is said to be connected if any two points in the group can be linked 
together by a continuous curve in G. 

O(n) is not connected. To see this, observe that if M € O(n) then MM? = 1 and on 
taking the determinant this implies det M = +1. Now take M € O(n) with det M = —1, 
so if O(n) is connected, there is a continuous curve y : [0,1] — O(n) with 4(0) = I and 


y(1) = M. We can then compute det y(t) which must be a continuous real function of t 
such that det y(t) € (—1,1) for all t € [0,1] and det 4(0) = 1, det4(1) = —1. This is not 
possible. 
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We shall say that two points in G are connected if they can be linked with a continuous 
curve. This defines an equivalence relation on G, and hence partitions G into equivalence 
classes of connected points; the equivalence class of g € G is called the connected component 
of g. The equivalence class of points of O(n) connected to I is SO(n), which is connected. 


2.4 Tangent Vectors 


Suppose that U is an open subset of a manifold M, and that the curve y passes through 
some p € U with 4(to) = p. Then the curve defines a tangent vector at p, denoted by 4p, 
which maps smooth real functions f : U — R to R according to 


tp f [SG on, (2.7) 


The components of the tangent vector are 


AP = [SG o Winey = 4010) (2.8) 


Note that one can write (using the chain rule) 


Msasa 


~ 
ll 
un 


(oz er (2.9) 


Il 
vi 
Ze 


Il 
= 


Proposition 1. The set of all tangent vectors at p forms a n-dimensional vector space 
(where n = dim M), denoted by T,(M). 
Proof 

Suppose that p lies in the chart U with local co-ordinates r. Suppose also that V, 
W € T,(M) are tangent vectors at p corresponding to the curves y, c, where without 
loss of generality we can take y : (a,b) —^ M, co : (a,b) —^ M with a < to < b and 
y(to) = o(to) = p 

Take a,b € R. Consider the curve in IR" defined by 


p(t) = a(z o y)(t) + b(z oco)(t) — (a+ b — 1)a(p) (2.10) 
where scalar multiplication and vector addition are the standard operations in R”. 
Note that p(tg) = x(p). 
Then define the curve p on U by p = z^ o p, so that p(to) = p. 
If f is a smooth function on U then by (2.9) it follows that 


o = d i 
a: T Dee C @ o 9) Ji=to 


py (f) = y 
i=1 


—17-— 


A NS (241) 


So it follows that a^ + bo; is the tangent vector to p at p. 
In order to compute the dimension of the vector space it suffices to compute a basis. 
To do this, define n curves p(i) for i = 1,...,n passing through p by 


( o pi) (t) = (a(p))? + tò; (2.12) 
Using (2.9) it is straightforward to compute the tangent vectors to the curves p(i) at 
P; 


ADA = ÈU o 27 ag (213) 


and hence, if 7 is a curve passing through p then (2.9) implies that 


Af) = M BG (2.14) 
gasi 


and hence it follows that 5, = $5; 4 329(i)y. Hence the tangent vectors to the curves 
p(i) at p span T;(M). M 

Given the expression (2.13), it is conventional to write the tangent vectors to the curves 
p(i) at p as 

jy = (223), (2.15) 
Lemma 1. Suppose that Mı, Mə are smooth manifolds of dimension nı, ng respectively. 
Let M = Mı x Mə be the Cartesian product manifold and suppose p = (py, p2) € M. Then 
Ty (M) = Ty, (Mi) Ð Tp. (Mo). 
Proof 

Suppose V, € T (M). Then V is the tangent vector to a smooth curve y(t), with 
*(to) = p. Write y(t) = (y1 (t), y2(t)); i(t) is then a smooth curve in M; and 4;(to) = pi 
for i — 1,2. 

Let f be a smooth function f : M — R. Suppose that x^ are local co-ordinates on Mi 
for a = 1,...,n; and y” are local co-ordinates on M» for m = 1,...,n» corresponding to 
charts U; C Mı and U5 C Mo. 

Then one has n4 + n2 local co-ordinates z% on M where if q = (q1, q2) € U1 x Us, 


z(q1,q2) = (x! (a1)... a" (31), y! (2), "? (q2)) (2.16) 
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Note that fi(qi) = f(a, q2) is a smooth function of qı when qə is fixed, and fo(q2) = 
f(q1, q2) is a smooth function of q2 when qı is fixed. 
Then using the chain rule 


nitn2 
Vit = yo 5 af oz boy (o0) (t))li-to 

a=1 
E ^ d a 

-— gra 1 0 eeue) gu o m" (leno 
a=1 
c. d ; 

+ 3 ay V? o lepya = t o 72)" (t))| ito 
m 

= (V(1)p + V(2)p)f (2.17) 


where V(1), is the tangent vector to 7 at p, and V(2), is the tangent vector to 72 at 
p. Hence V; = V(1)p + V(2)p. Conversely, given two smooth curves *1(t), y2(t) in Mi, M» 
passing through p; and p» at t = to, with associated tangent vectors V (1), and V(2),, one 
can construct the smooth curve y(t) = (^1(t),ya(t)) in M passing through p = (pi, p2) at 
t = tg. Then (2.17) shows that V(1), + V(2), can be written as V, € T (M). 


2.5 Vector Fields and Commutators 
The tangent space of M, T(M) consists of the union 


T(M) = LJ T; (v) (2.18) 
pEM 


A vector field V on M is a map V : M — T(M) such that V (p) = Vp € T,(M). 
Note that T(M) is a vector space with addition and scalar multiplication defined by 


(X E Y)() - X() +Y) (2.19) 
where X, Y € T(M) and f : M — R is smooth, and 


(aX)(f) = aX(f) (2.20) 


for constant a € R. 
At a point p € M, one can decompose V, into its components with respect to a 
particular chart as 


v= VG, 221) 
It is conventional to write 
TE) 
V=V' - 2.22 
(5) (2.22) 


where V* = (V o z-1)(a?) are functions R^ — R and ( 2) is a locally defined vector 
field which satisfies 
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i d = ô 


( Ox" 
It follows that T'(M) is n-dimensional with a local basis given by the ( 


(2.23) 


i 
2). The vector 
field is called smooth if the functions V* are smooth functions on R”. 


Suppose now that f is a smooth function on M and that V, W are vector fields on M. 
'Then note that V f can be regarded as a function M — RR defined by 


(V f)(n) = Vor (2.24) 
Hence one can act on V f with W, at some p € M to find 


Wf) = Wiz) (Vf) 
Ju as D)o 
= wm REN 27!) 


Ox' Ox soe TO Dle (2.25) 


The fact that there are second order derivatives acting on f means that we cannot 
write W,(V f) = Zpf for some vector field Z. 
However, these second order derivatives can be removed by taking the difference 


+ wivi (5 


,0VÀ Ow! Ó 
W,(V f) — V(W f) = (W; P lew) — Va lz o) o9 lew) (2.26) 


which can be written as Zpf where Z is a vector field called the commutator or alter- 
natively the Lie bracket of W and V which we denote by [W, V] with components 


; VÍ , j 
wew V pa (2.27) 
Ox" Ox" 
Exercise: 
Prove that the Lie bracket satisfies 
i) Skew-symmetry: [X,Y] = —[Y, X] for all smooth vector fields X, Y € T(M). 


ii) Linearity: [aX + GY, Z] = o|X, Z] + 8|Y, Z] for a, B constants and X, Y, Z € T(M). 
ii) The Jacobi identity: [[X, Y], Z]+[[Z, X], Y]+[[Y, Z], X] = 0 forall X, Y, Ze T(M). 


Definition 11. Let V be a smooth vector field on M. An integral curve o(t) of V is a 
curve whose tangent vector at a(t) is V |;(y, i.e. 


Ll) = Và (228) 


where in a slight abuse of notation, o'(t) = (x o a)'(t) for some local co-ordinates x. 
Such a curve is guaranteed to exist and to be unique (at least locally, given an initial 
condition), by the standard existence and uniqueness theorems for ODE’s. 
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2.6 Push-Forwards of Vector Fields 


Suppose that M, N are two smooth manifolds and f : M — N is a smooth map. Then 
there is an induced map 


fe: T(M) > T(N) (2.29) 


which maps the tangent vector of a curve y passing through a point p € M to the 
tangent vector of the curve f o y passing through f(p) € N. 

In particular, for each smooth function h on N, and if y is a curve passing through 
p € M with 4(0) = p, and if V; € T,(M) is the tangent vector of y at p then f. V, € Typ) (N) 
is given by 


(f V5)h = S (ho (f). 
-5 


(ho f) (2.30) 


Hence it is clear that the push-forward map f. is linear on the space of tangent vectors. 
Note that if M, N and Q are manifolds, and f : M —^ N, g: N — Q are smooth 
functions then if h : Q — R is smooth and p € M, 


((go f), V5) (h) = Vp(ho (go f) 
= V,((hog)o f) 


= (&V5)(ho g) 
= (g«(f«Vo)) (A) (2.31) 
and hence 
(go f)« = 9«o f. (2.32) 


2.7 Left-Invariant Vector Fields 


Suppose that G is a Lie group and a, g € G. Define the operation of left-translation 
La: G —> G by 


Lag = ag (2.33) 


La defined in this fashion is a differentiable invertible map from G onto G. Hence, one 
can construct the push-forward Las of vector fields on G with respect to Dg. 


Definition 12. A vector field X € T(G) is said to be left-invariant if 


Lal Xlo) = X lag (2.34) 


Given v € T.(G) one can construct a unique left-invariant vector field X(v) € T(G) 
with the property that X(v). = v using the push-forward by 
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X(v)lg = Lg.v (2.35) 
To see that X (v) is left-invariant, note that 


X (v)lag = L(agy«U (2.36) 
but from (2.32) it follows that as Lag = La o Lg we must have 


Lagt = (La 9 Lg)«v = Las(Lg«v) = La«X(v)g (2.37) 


so X(v) is left-invariant. Hence there is a 1-1 correspondence between elements of the 
tangent space at e and the set of left-invariant vector fields. 


Proposition 2. The set of left-invariant vector fields is closed under the Lie bracket, i.e. 
if X, Y € T(G) are left-invariant then so is [X,Y]. 
Proof 

Suppose that f : G — IR is a smooth function. Then 


(Lge OY alt = [X, Y]g(f o La) 
= X4(Y(f o La)) — Y(X (F o La)) (2.38) 


But as X is left-invariant, LaxXg = Xag so 


Xagf = (LaxXg)f = Xg(f o La) (2.39) 


so replacing f with Y f in the above we find 


XQ4((Y f) o La) = Xag(Y f) (2.40) 


Moreover, as Y is left-invariant, it is straightforward to show that 


(Y(f o L4))g = Y«f 9 La) 
= (Las VY) f 
= Yag(f) 
= (Y f)(ag) 
= ((Y f) o Lag (2.41) 


so Y (f o La) = (Y f) o La 
Hence 


(Y f) o La) — Yo (CX f) o La) 
= XaglY f) = Yag(X f) 
= |X, Y leas (2.42) 


So L4,[X, Y]; = [X, Y ]ag, hence [X,Y] is left-invariant. M 
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2.8 Lie Algebras 


Definition 13. Suppose that G is a Lie group. Then the Lie algebra L(G) associated 
with G is T.(G), the tangent space of G at the origin, together with a Lie bracket | , | : 
L(G) x L(G) — L(G) which is defined by 


[v, w] = [L.v, D.w]e (2.43) 


for v,w € Te(G), L.v and Lw denote the smooth vector fields on G obtained by pushing 
forward v and w by left-multiplication (i.e. L,v|; = Lg.v), and |L.v, Lyw] is the standard 
vector field commutator. As the Lie bracket on L(G) is obtained from the commutator of 
vector fields, it follows that the Lie bracket is 


i) Skew-symmetric: |v, w] = —[w, v] for all v,w € L(G). 
ii) Linear: [ovi + Bv, w| = ovi, w] + Bluz, w] for o, B. constants and v1, v2, w € L(G), 


iii) and satisfies the Jacobi identity: [[v, w], z] + [[z, v], w] + [[w, z], v] = 0 for all v, w, 
z € L(G). 


where (ii) follows because the push forward map is linear on the space of vector fields, 
and (iit) follows because as a consequence of Proposition 2, Lj |v,w| = [L.v, L.w],. 
More generically, one can also define a Lie algebra to be a vector space g equipped 


with a map |, ] : g x g > g satisfying (i), (ii), (iii) above. 


Definition 14. Suppose that (T; : i =1,...,n} is a basis for L(G). Then the T; are called 


generators of the Lie algebra. As [T;,T;] € L(G) it follows that there are constants cij" 
such that 


ID e Th (2.44) 


k 


The constants cij” are called the structure constants of the Lie algebra. 


'The structure constants are constrained by the antisymmetry of the Lie bracket to be 
antisymmetric in the first two indices; 


cij" = —cj;" (2.45) 


Also, the Jacobi identity implies 


[3,75], Te] + [[T;; T]; Ze] + [ET Ti], 15] = 0 (2.46) 


which gives an additional constraint on the structure constants 


cij Cok” + ek eg" + Chi eo; = 0 (2.47) 
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2.9 Matrix Lie Algebras 


'The Lie algebras of matrix Lie groups are of particular interest. Suppose that G is a matrix 
Lie group, and V € T(G) is a smooth vector field. Let f be a smooth function of the matrix 
components g^P. Then if h € G, 


An Of (2.48) 


where 


(2.49) 


defines a tangent matrix associated with the components V7" of V at h. Each vector 
field has a corresponding tangent matrix, and it will often be most convenient to deal with 
these matrices instead of more abstract vector fields as differential operators. 

In particular, if y(t) is some curve in G with tangent vector V then 


d 
Vf= aU ot) 
= dg^P of 
= “dE BgAB (2.50) 


AB 
hence the tangent vector to the curve corresponds to the matrix eee We will fre- 


quently denote the identity element of a matrix Lie group by e = 1 
Examples of matrix Lie algebras are 


e a) GL(n,R): the co-ordinates of GL(n,R) are the n? components of the matrices, so 
GL(n,R) is n?-dimensional. There is no restriction on tangent matrices to curves in 
GL(n,R), the space of tangent vectors is Mj, (IR), the set of n x n real matrices. 


e b) GL(n, C): the co-ordinates of GL(n, C) are the n? components of the matrices, so 
GL(n, C) is 2n?-dimensional when viewed as a real manifold. There is no restriction 
on tangent matrices to curves in GL(n, C), the space of tangent vectors is M, (C), 
the set of n x n complex matrices. 


c) SL(n, R): Suppose that M(t) is a curve in SL(n, R) with M(0) = I. To compute 
the restrictions on the tangent vectors to the curve note that 


det M(t) 21 (2.51) 


so, on differentiating with respect to t, 
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dM(t), _ 
dT js 


Tr(M-(t) (2.52) 


dM(t) 
dt 
Tr m = 0. The tangent vectors correspond to traceless matrices. Hence SL(n,R) is 


and so if we denote the tangent vector at the identity to be m — lio then 


n? — 1 dimensional. 


e d) O(n): suppose that M(t) is a curve in O(n) with M(0) = I. To compute the 
restrictions on the tangent vectors to the curve note that 


M(t)M(t)? 21 (2.53) 


so, on differentiating with respect to t, 


dM(t) E dM(t)* 
— — M(t M (t —0 2.54 
"AM (0? + M(0) (2.54) 
and hence if m = ue [i-o then m+m? = 0. The tangents to the curve at the identity 


correspond to antisymmetric matrices. There are $n(n — 1) linearly independent 
antisymmetric matrices, hence O(n) is 3n(n — 1)-dimensional. 


Note that the Lie algebra of SO(2) is 1-dimensional and is spanned by 


n-( 1 o (255 


As [T1, T1] = 0 it follows trivially that the Lie bracket vanishes 


e f) SO(n) the group of n x n real matrices such that if M € SO(n) then MMT — 1 
and det M — 1. By the reasoning in (c) and (e) it follows that the tangent matrices 
at the identity are skew-symmetric matrices (these are automatically traceless). 


As the skew symmetric matrices are automatically traceless, it follows that the Lie 
algebra £L(SO(n)) of SO(n) is identical to the Lie algebra of O(n). If v,w € £(SO(n)) 
are skew-symmetric matrices it is straightforward to show that the matrix commu- 
tator [v, w] is also skew symmetric, as [v,w]T = (vw — wv)? = wl? — vTwT = 
[w,v] = —[v,w]. Hence [v,w] € £(SO(n)) as expected. We will show that vector 
field commutation can be reduced to tangent matrix commutation for matrix Lie 


groups. 
Exercise 


Show that the tangent vectors of U(n) at I consist of antihermitian matrices, and the 
tangent vectors of SU(n) at I are traceless antihermitian matrices. 
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Proposition 3. Suppose that G is a matrix Lie group and V is a smooth vector field on 
G and a € G is fired. If V denotes the tangent matrix associated with V, then the tangent 
matrix associated with the push-forward LasV is aV. 
Proof 

Suppose h € G, and f : G — R is a smooth function on G. Consider the tangent 
vector L4,Vj, defined at ah 

'Then 


(LaxVn) f = Vil f o La) 


= Vif (2.56) 


where f(g) = f(ag). 
So 


= (a8 DT |, (2.57) 


So it follows that the tangent matrix associated with LV; is aV. M 
Using this result, it is possible to re-interpret the commutator of two left-invariant 
vector fields in terms of the matrix commutators of their associated matrices. 


Proposition 4. Suppose that G is a matrix Lie group and that v,w € Te(G) and V, W 
are the left-invariant vector fields defined by Vg = Lgxv, Wg = Lg«w. Then the matrix 
associated with |V, W]e is the matrix commutator of [6,4] where ô and iv are the matrices 
associated with v and w. 
Proof 

By definition, the matrix associated with |V, W] is 


AB 
v. w]^? = v, wy 29 
B pƏW™ 9g^P av™ 9g^P 
7 oe aoe ox? C li 3,48 
= P m g p m g 
d aap gy ) "d às; ga ) 
oW OV 
— yp p 
V B W uP (2.58) 


where V and W denote the matrices associated with V and W. But from the previous 
proposition V, = g^C CP and W, = gAC QCP so 
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o ð 
IV, w]48 = v? "m leB -WP Dm leô B 
E jAC SUB ~AC «CB ái 
= [0,47] 4P (2.59) 


as required. M 

We have therefore shown that if G is a matrix Lie group then the elements £(G) can be 
associated with matrices and the Lie bracket is then simply standard matrix commutation 
by Proposition 4 (which can be directly checked satisfies all three of the Lie bracket for 
Lie algebras). In the literature, it is often conventional to denote the Lie algebra of SO(n) 
by so(n), su(n) is the Lie algebra of SU(n), u(n) the Lie algebra for U(n) etc. We will 
however continue to use the notation L(G) for the Lie algebra of Lie group G. 

Observe that the image [C(G), £(G)] under the Lie bracket need not be the whole of 
L(G). This is clear for SO(2), as the Lie bracket vanishes identically in that case. Recall 
that the Lie bracket on IR viewed as a Lie group under addition vanishes identically as 
well. If G is a connected 1-dimensional Lie group then G must either be isomorphic to R 
or SO(2) (equivalently U(1)). 


2.10 One Parameter Subgroups 


Definition 15. A curve a : IR — G is called a one-parameter subgroup if o(s)o(t) = o(s+t) 
for all s,t € R. 

Note that if c(t) is a 1-parameter subgroup then o(0) = e. 

We shall show that these subgroups arise naturally as integral curves of left-invariant 
vector fields. 


Proposition 5. Suppose that V is a left-invariant vector field. Let c(t) be the integral 
curve of V which passes through e when t = 0. 

Then o(t) is a 1-parameter subgroup of G. 
Proof 

Let x denote some local co-ordinates. 

Consider the curves x1(t) = o(s)o(t) and y2(t) = ao(s + t) for fixed s. 

These satisfy the same initial conditions x1(0) = x2(0) = a(s). 

By definition, x2 satisfies the ODE 


qi (n oxalt)") = Gp n e os +4)" 
= Vo(s-+4)(2") 


= Vot) Ge”) (2.60) 


Consider 
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= E o Lo(s) o £7!) o (xo a)(t))”) 
d 


= E ((a o Lo(s) o aT) loot) Ge t o a(t))™) 


where we have used the chain rule. But by definition of c(t), 


S (o e(0)") = V 


Hence, substituting this into the above: 


So x1, X2 satisfy the same ODE with the same initial conditions. 


(2.61) 


(2.62) 


(2.63) 


Hence it follows that o(s)o(t) = o(s +t), i.e. o defines a 1-parameter subgroup. M 
The converse is also true: a 1-parameter subgroup c(t) has left-invariant tangent 


vectors 


Proposition 6. Suppose c(t) is a 1-parameter subgroup of G with tangent vector V. Sup- 
pose Ve =v. Then Vo) = Lov, i.e. the tangent vectors are obtained by pushing forward 


the tangent vector at the identity. 
Proof 
Suppose f : Œ — R is a smooth function. Then 


Vent = S ((oo)0)) 
di (o9 = frt), 
$9 fle(Be(h)) — fet) 
a - fe) 
w o Lei) o o(t^))|e-o 
x (Lo(t)xv) f 


SO Vott) m Ls (t«v. a 
From this we obtain the corollory 


(2.64) 


Corollory 1. Suppose that c(t), u(t) are two 1-parameter subgroups of G with tangent 


vectors V, W respectively, with Ve = We = u. Then o(t) = u(t) for all t. 
Proof 
Note that 
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= (Loo. u)o" (2.65) 


and also 


S Gro n(0)") = Woga” 
= (Lygg«u)z^" (2.66) 


So coo and z o u satisfy the same ODE and with the same initial conditions, hence 


2.11 Exponentiation 


Definition 16. Suppose v € T.(G), Then we define the exponential map exp : Te(G) —^ G 
by 


exp(v) = 0y(1) (2.67) 


where o,(t) denotes the 1-parameter subgroup generated by X(v), and X(v) is the 
left-invariant vector field obtained via the push-forward X(v)g = Lv 
Note that exp(0) — e. 


Proposition 7. [fv € T.(G) and t € R then 


exp(tv) = o.(t) (2.68) 


and hence exp((t1 + t2)v) = exp(tiv)exp(tev). 
Proof 

Take a € R, a £0. Note that o,(at) and og,(t) are both 1-parameter subgroups of G. 
The tangent vector to dq,(t) at the origin is av. 


We also compute the tangent vector to a, (at) at e via 


d T, d n n 
di ((x o o(at)) Jess = "Tan e o a(at)) Jon = av (2.69) 
So o,(at) and as, (t) have the same tangent vector av at the origin. Therefore o,(at) = 
Galt): 
Hence 


exp(tv) = oty(1) = a(t) (2.70) 


as required. M 
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2.12 Exponentiation on matrix Lie groups 


Suppose that G is a matrix Lie group, and v € T.(G) is some tangent matrix. The 
exponential exp(tv) produces a curve in G with 4 (exp(tv))|t-o = v satisfying exp((t4 + 
t2)v) = exp(t1v) exp(tav) 

It is then straightforward to show that 


d exp (to) liu, = lim (E exp((to + 0o) — exp(tou))) 
lim (t7 (exp(tv) — I) exp(tov)) 
v exp(tov) (2.71) 


Similarly, one also finds 4 (exp(tv))|i-t = exp(tov)v, so v commutes with exp(tv). 

It is clear that 4 exp(tv) = vexp(tv) implies that exp(tv) is infinitely differentiable (as 
expected as the integral curve is smooth by construction). Then by elementary analysis, 
one can compute the power series expansion for exp(tv) as 


exp(tv) = » Z (2.72) 


with a remainder term which converges to 0 (with respect to the supremum norm on 
matrices, for example). Hence, for matrix Lie groups, the Lie group exponential operator 
corresponds to the usual operation of matrix exponentiation. 

Comment: Suppose that G4 and G2 are Lie groups. Then G = G1 x Gə is a Lie group, 
and by Lemma 1, L(G) = L(G) B £(G3). 

Conversely, suppose Lie groups G, G1, G2 are such that L(G) = £(G41) B £(G3). Then 
by exponentiation, it follows that, at least in a local neighbourhood of e, G has the local 
geometric structure of G4 x Go. However, as it is not in general possible to reconstruct the 
whole group in this fashion, one cannot say that G = G x G2 globally (typically there will 
be some periodic identification somewhere in the Cartesian product group). 


In general, one cannot reconstruct the entire Lie group by exponentiating elements of 
the Lie algebra. Consider for example, SO(2) and O(2). Both £(O(2)) and £(SO(2)) are 


generated by 
0 1 
quo 2.73 
E Jj (2.73) 


however it is straightforward to show that 
Pn eos 0 sind (2.74) 
— sin 0 cos0 


which always has determinant +1. So SO(2) = exp(£(SO(2))) but O(2) Z exp(£(O(2))). 
However, there do exist neighbourhoods Bo of 0 € L(G) and Bı of I € G such that the 
map exp : By — B; is invertible. (The inverse is called log by convention). 
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2.13 Integration on Lie Groups 


Suppose that G is a matrix Lie group, and let V be a left-invariant vector field on G, and 
suppose that the associated tangent matrix to V at the identity is ô. 
'Then if z are some local co-ordinates on G, we know that 


» ym 99(z) 
g(z)8 = Vot, c (2.75) 
From this formula, it is clear that if h € G is a constant matrix then 
Voce) = Vigla) (2.76) 


If H = {hy,...,h,} is a finite group, and f : H — R is a function, then the integral of 
f over H is simply 


25 f(h;) (2.77) 


and note that if h € H is fixed then 


S f) = M fn) (2.78) 
i=1 


i=1 


We wish to construct an analogous integral over a matrix Lie group G. Suppose that 
x,y are co-ordinates on G and define 


dx = dal... da^, dy = dy! ... dy” (2.79) 


Note that d”x and d"y are related by 


dg = J idy (2.80) 
where J is the Jacobian J = det ( ov). 
Now suppose that u; fori = 1,...,n is a basis of left-invariant vector fields. Then 
o 
Lil g(x) a Hi o(a) Dal (2.81) 


Then we have 


Hi g(x) Op) Mi g(x) ET y = Hi g(y) 8yi (2.82) 
SO 
ys 
j EN: 
Fig(y) — Figle) grk (2.83) 
and hence 
det (Hi oxy) = J`! det (M (2.84) 


Motivated by this, we make the 


ES 


Definition 17. The Haar measure is defined by 


n j c1 
d z(det(u ,(,)) (2.85) 
Then by the previous reasoning, 
n j -1 n j -1 
d z (det(u ,(,,)) =d y(det( ,))) (2.86) 
so the measure is invariant under changes of co-ordinates. l l 
Also, if h is a constant matrix, then as the u; are left-invariant, ni ae ni hg(z)' and 
SO 
n j -1 n j =l 
d x (det(u ,,,)) =d x ( det(u; ng(a))) (2.87) 


It follows that if f : G — R, then 


f ralat ce) fet) = f Paletti Ae) 288) 


It can be shown that the Haar measure (up to multiplication by a non-zero constant) 
is the unique measure with this property. 


Example: SL(2,R) 
Consider g € SL(2, R), 
a b 
= 2.89 
g | p Jj (2.89) 
for a,b,c,d € R constrained by ad — bc = 1. Note that 
gi= ( d a] (2.90) 
—c a 


We take co-ordinates x! = b, £? = c, x d (in some neighbourhood of the identity). 
Then 


wii e «(iu — e» 


to be a basis for C(SL(2, R)). Then note that 
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c NS 
0 _, Og 
U3 = ag D | cg b (2.93) 


It follows that the left-invariant vector fields obtained from pushing-forward the vector 
fields associated with v1, vo, v3 at the identity with L, are 


E Ó ^ Ó d o 
p dal ' Or? Ox? 
qe 
H2 op : 
H3 teg + CET (2.94) 
So the matrix ul is 
l —b c -d 
GS 0 d 0 (2.95) 
a 0 


As det (jid ) = d it follows that the Haar measure in these co-ordinates is 4 db dc dd. 


2.14 Representations of Lie Groups 


Definition 18. Let V be a finite dimensional vector space (over R or C) and let GL(V) 
denote the space of invertible linear transformations V — V. Then a representation of a 
Lie group G acting on V is a map D : G — GL(V) such that 


D(g1g2) = D(q1)D(g2) (2.96) 


for all go, go € G. (i.e. D is a homomorphism). The dimension of the representation is 
given by dim D — dim V. 


Lemma 2. If D is a representation of G then D(e) = 1 where 1 € GL(V) is the identity 
transformation, and if g € G then D(g-!) = (D(g))-!. 
Proof 

Note that D(e) = D(ee) = D(e)D(e) and so it follows that D(e) = 1 where 1 € GL(V) 
is the identity transformation. 

If g € G then 1 = D(e) = D(gg !) = D(g)D(g !), so D(g ') = (D(g))*. W 

If M(V) denotes the set of all linear transformations V — V, and D : G — M(V) 
satisfies D(e) = 1 together with the condition (2.96) then it follows from the reasoning 
used in the Lemma above that D(g) is invertible for all g € G, with inverse D(g-!), and 
hence D is a representation. 

We next define some useful representations 


Definition 19. The trivial representation is defined by D(g) = 1 for allg € G 
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Definition 20. If G is a matriz Lie group which is a subgroup of GL(n,R) or GL(n, C) 
then the group elements themselves act directly on n-component vectors. The fundamental 
representation is then defined by D(g) = g. 


Definition 21. If G is a matrix Lie group then the adjoint representation Ad : G — 
GL(L(G)) is defined by 


(Ad(g))X = gXg ! (2.97) 
for g € G and X € L(G) is a tangent matrix. 


Lemma 3. Ad(g) as defined above is a representation 
Proof 

We first verify that if X € L(g) then Ad(g) X € L(G). 

Fix g € G. Next, recall that if X € L(G) then there is some smooth curve in G, y(t) 
such that X — era Define a new smooth curve in G by p(t) = g»(t)g-!, then the 
tangent matrix to p(t) at t = 0 is given by vory = g 2 og —yxgt. 

Hence Ad(g)X € L(G). 

It is clear that Ad(g) is a linear transformation on X € L(G). 

Note that Ad(e)X = eXe^! = X, so Ad(e) = 1. Also, if g1, g» € G then 


Ad(gig2)X = (gygo) X(gy92) 
= gg2Xg5 9i. 
= (92X95 ')97 
= gq(Ad(gz) X)gi ! 
= Ad(g1) Ad(g) X (2.98) 


1 


hence Ad(gig2) = Ad(g1)Ad(g2). 
It then follows that Ad(g)Ad(g~!) = Ad(gg~') = Ad(e) = 1 so Ad(g) is invertible. NI 


Definition 22. Suppose D is a representation of G acting on V. A subspace W C V is 
called an invariant subspace if D(g)w € W for all g € G and w € W. 


Definition 23. Suppose D is a representation of G acting on V. Then D is reducible if 
there is an invariant subspace W of V with W 40 and W Æ V. If the only invariant 
subspaces of V are 0 and V then D is called irreducible. 


Definition 24. A representation D is called totally reducible if there exists a direct sum de- 
composition of V into subspaces W;, V = Wi Q W2 QD ... B We where the W; are invariant 
subspaces with respect to D and D restricted to Wi is irreducible. 

In terms of matrices, if D is totally reducible, then there is some basis of V in which 
D has a block diagonal form 


D(g-| 9 D»(g) (2.99) 


GA 


Dj(g) denotes D(g) restricted to Wj. 
Definition 25. A representation D of G acting on V is faithful if D(g) = 1 only if g =e. 


Definition 26. Suppose that D is a representation of G acting on V where V is a vector 
space over C equipped with an inner product. Then D is a unitary representation if D(g) : 
V — V satisfies D(g)D(g)! —1 for all g € G. 


Proposition 8. A finite dimensional unitary representation is totally reducible 
Proof 

If D is irreducible then we are done. Otherwise, suppose that W is an invariant 
subspace. Write V = W @W,. Suppose v € W,. Then if w € W and g € g, 


=0 (2.100) 


as D(g !)w € W because W is an invariant subspace. 

Hence it follows that if v € W, then D(g)v € W, and so W, is also an invariant 
subspace. Repeating this process by considering D restricted to the invariant subspaces W 
and W, one obtains a direct sum decomposition of V into invariant (orthogonal) subspaces 
W; such that D restricted to W; is irreducible. BM 


Proposition 9. Let Vi, V2 be finite dimensional vector spaces. Suppose D is a represen- 
tation of G acting on Vi, and A: Vy — V2 is an invertible linear transformation. Define 
D(g) = AD(g)A-!. Then D is a representation of G on Vo. 
Proof 

As D is a composition of invertible linear transformations, D is also an invertible linear 
transformation on V5. 

Also, if 91,92 € G 


D(gig) = AD(g192)A~ 
= AD(g1)D(ga)A^ : 
— AD(g1)A 1 AD(g3) A7 : 
= D(g)D(gz) (2.101) 


and hence D is also a representation. M 


Definition 27. Suppose D is a representation of G acting on Vi, and A: Vi — V3 is an 
invertible linear transformation. Define D(g) = AD(g)A-!. Then D and D are said to be 
equivalent representations. 
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Proposition 10. Schur’s First Lemma: Suppose that Dı and D» are two irreducible rep- 
resentations of G acting on Vi and V» respectively and there exists a linear transformation 
A : Vi — V such that 

AD, (g) = Da(g)A (2.102) 
for allg € G. Then either Dı and Də are equivalent representations, or A = 0. 
Proof First note that 


Ker A= {weV,: Av =0} (2.103) 


is an invariant subspace of D1, because if Yy € Ker A then 


ADi(g)v = Do(g) Ay = 0 (2.104) 
so Di(g)v € Ker A for all g € G. But D is irreducible on Vi, so one must have 


Ker A=0 or Ker A= Vj, so Ais 1-1 or A = 0. 
Similarly, 


Im A= (6 € Vo: ¢= Av for some v € Vi} (2.105) 


is an invariant subspace of D», because if o € Im A then there is some v € Vj such 
that $ = Ay and hence 


D5(g)ó = Do(g) Av = ADi(g)u (2.106) 
and hence D2(g)¢ € Im A for all g € G. But D» is irreducible on V5, so one must have 
Im A=0 or Im A= Və, i.e. A = 0 or A is onto. 
Hence either A = 0 or A is both 1-1 and onto ie. A is invertible. If A is invertible 
then Dı and D» are equivalent. M 


Proposition 11. Schur’s Second Lemma: Suppose that D is an irreducible representation 
of G on V, where V is a vector space over C, and A: V > V is a linear transformation 
such that 


AD(g) = D(g)A (2.107) 


for all g € G. Then A = Al for some A € C. 
Proof 
As V is over C, A has at least one eigenvalue. Let A € C be an eigenvalue of A, with 
corresponding eigenspace U (U # 0). Then U is an invariant subspace of V with respect 
to D, for if € U then 
Ay = à (2.108) 
and if g € G, then 
AD(g)b = D(g) AY = D(g) (AY) = AD(g)u (2.109) 


so D(g)v € U. 
But D is irreducible on V, so this implies U = V (as U Z 0). 
Hence it follows that A — A1. M 
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Definition 28. Suppose that Dı and D» are representations of the Lie group G over vector 
spaces V1 and V» respectively. Let V = Vi G9 V3 be the standard tensor product vector space 
of Vi and Vz consisting of elements vı & v9 (vı € Vi and v2 € V2) in the vector space dual 
to the space of bilinear forms on Vi x Vo. If v4 & v9 € V then vı Q v9 acts linearly on 
bilinear forms Q via v VN = Q(vi, v2). V is equipped with pointwise addition and scalar 
multiplication which satisfy (vı + w1) Q (v9 + w2) = v1 & v2 + v Q wo + W1 Q ve + W1 Q we 
and a(v1 ® v2) = (avı) & v? = v1 ® (ave). 
Then the tensor product representation D is defined as a linear map on V satisfying 


D(g)vi & ve = Di(g)v1 & Do(g)v2 (2.110) 


for g € G and vı € Vy and v9 € Və 


Proposition 12. The tensor product representation defined above is a representation. 
Proof 
The map D(g) is linear by construction, also 


D(e)u & v? = Di(e)v1 & Do(e)vo = v1 ® v2 (2.111) 
because D4(e) = 1 and Do(e) = 1. Hence D(e) = 1. And if g1, g € G then 


D(gig2)v1 Q ve = Di(gig2)vi & Do(g192)vo 
= Di(gi)Di(go)ui & Do(g1)Do(g2)vo 
= D(g)(Di(go)ui & D2(g2)v2) 
= D(gi)D(ga)ui & v2 (2.112) 


so D(g1g2) = D(g1)D(go). Hence, this together with D(e) = 1 implies that D(g) is 
invertible. So D(g) is a representation. MI 

Note that if Dı is irreducible on Vi and D» is irreducible on V? then D = D (9 D» is 
not generally irreducible on V = Vi 69 V2. Indeed, we shall be particularly interested in 
decomposing D into irreducible components in several explicit examples. 


2.15 Representations of Lie Algebras 
Definition 29. Let V be a finite dimensional vector space (over IR or C) and let M(V) 
denote the space of linear transformations V — V. Suppose that L(G) is the Lie algebra of a 
Lie group G. Then a representation of L(G) acting on V is a linear map d : L(G) —^ M(V) 
satisfying 
d([X,Y]) = d(X)d(Y) — d(Y)d(X) (2.113) 
for all X,Y € L(G). The dimension of the representation is the dimension of V. 


Definition 30. The trivial representation of L(G) on V is given by d(X) = 0 for all 
X € L(G) 


= 37-— 


Definition 31. If G is a matrix Lie group and hence L(G) is a matrix Lie algebra, then 
the tangent vectors can be regarded as matrices acting directly om n-component vectors. 
Then we define the fundamental representation of L(G) on V by d(X) = X 

There is a particularly natural representation associated with any Lie algebra. 


Definition 32. Let L(G) be a Lie algebra. Then the adjoint representation is a represen- 
tation of L(G) over the vector space L(G), ad: L(G) —^ M(L(G)) defined by 


(ad v)w = [v, w] (2.114) 
for v,w € L(G). 
It is clear from the above that (ad v)w is linear in w , hence ad v € M(L(G)), and 


ad v is also linear in v. 
Moreover, if 1,79, w € L(G) then 


[v1, vo], w] 
V1, [v2, u]] — [v2, |i, w]] (using the Jacobi identity) 


(ad [v1, v9]) w 


-[ 
-[ 
ad vi)[vo, w] — (ad v2)[v1, w] 


x 
= (ad v1)(ad v2)w — (ad v2)(ad vi)w (2.115) 


so ad is indeed a representation. 


2.16 The Baker-Campbell-Hausdorff (BCH) Formula 


'The BCH formula states that the product of two exponentials can be written as an expo- 


nential: 


exp(v) exp(w) = exp(v + w + slow | E [v, |v, w]] + zio lu] +...) (2.116) 


where ... indicates terms of higher order in v and w. For simplicity we shall consider 


only matrix Lie groups, in which case the Lie algebra elements are square matrices. 


To obtain the first few terms in this formula, consider etet” as a function of t and set 


e70) — eet (2.117) 


At t = 0 we must have eZ) = I, which is solved by taking Z(0) = 0 (this solution is 
unique if we limit ourselves to the neighbourhood of the identity on which exp is invertible). 
Hence we can write Z(t) as a power series 


Z(t) 2tP 4 eQ + O(t?) (2.118) 


where we determine the matrices P and Q by expanding out (2.117) in powers of t: 


1 1 
I+t(v+w)+ at (we v? + 2uw) + O(9) 2 T1-- tP + 5¢ (Q -P?)--O(t) (2.119) 


from which we find P = v + w and Q = |v, w]. 
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Proposition 13. All higher order terms in the power series expansion of Z(t) in the BCH 
formula depend only on sums of compositions of commutators on v and w. 
Proof 


Suppose that Z(y) is an arbitrary square matrix. Consider 


O (.2Z(y) 
2 2.12 
flay) = a (799) (2.120) 
and 
foley) = f^ decim ZO oco, (2.121) 
l-r Oy 
These both satisfy 
Of; OZ(y), 
Bee = CEU 2200) + Z(y fiin v) (2.122) 


and f;(0,y) = 0 for i= 1,2. Hence fi (x,y) = falx, y). 
Now suppose that Z(t) is the matrix appearing in the BCH formula, i.e. 


e70 = etet" (2.123) 
Then consider the identity fi(1,t) = fo(1,t). This implies that 
1 
Z(t 
v + ewe ™ = | eva) PZE) Luz (2.124) 
0 
Now consider the function 


g(t) = e"we ™ (2.125) 
this satisfies g(0) = Y and 


ME e tv n —tv 2.12 
dme (ad v)"we (2.126) 


hence the power series expansion of g(t) is given by 


ewe = vr 
(2.1 


Applying this expression to both sides of 
finds 


t” 
als 


(ad v) (2.127) 


24) and performing the y-integral, one 


Ue YS (ad v)” -2 "Yee 200)" < (2.128) 


Then by expanding out Z(t) = X>] Znt” (as we know that Z(0) = 0), it follows by 
induction using the above equation that the Z,, can be written as sums of compositions of 
commutators on v and w. M 

Exercise: Suppose that [v, w] = 0. Show that e"e" = et”, 
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Proposition 14. Suppose that D is a representation of the matrix Lie group G acting on 
V. Then there is a representation d of C(G) also on V defined via 


d 
d(v) = = (D(exp(tv)))li-o (2.129) 
for v € L(G). 
Proof 
It is convenient to expand out up to O(t?) by 


D(e^) =1+td(v) + ?h(v) + O(£) (2.130) 


Note that as D is a representation we must have D(e(1*!3)v) = p(ehvyp(e!2v) 


Hence 


1+ (ty 4-13)d(v) + (t + t3)? h(v) + O(t2) = (12-tjd(v) + t2h(v))(14- ted(v) + t2h(v)) + O(t2) 
(2.131) 


and so on equating the titz coefficient we find h(v) = Id(v)?. 


2 
Next consider for v,w € L(G) 


D(e fog Metre) = (1 — td(v) + Led)? (1 — td(w) + Fedw) 


) ) 
x (1+ td(v) + Leao)? EE A f'dQo)) + O(£) 
=1+ P(d(v)d(w) — d(w)d(v)) + O(t?) (2.132) 


But using the BCH formula 


2 PEE 142 1 3 142 3 
ê tvo tw ctv tw _ e t(vd-w)- 3t^|v,w]--O(t )etlvtw)+5t [v,w]+O(t?) 


€ 
= ef vw] +0’) (2.133) 
and so 


Dete t et ett) = D(e wo) zs t?d((v, w]) + O(t?) (2.134) 
Comparing (2.132) with (2.134) we find that 


d([v, w]) = d(v)d(w) — d(w)d(v) (2.135) 


as required. 
To show that d is linear, suppose v, w € L(G) and a, B are constants. Then 


Diemer) = D(e'*”)D(e di 
= (1+ tad(v) + O(t?))(1 + t8d(w) + O(?)) 
= 14 t(od(v) + Bd(w)) + O(£) (2.136) 
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But by the BCH formula 


D(et gt8w) — D(etlart Bw) +0?) —1-4 td(av + Bw) + olt?) (2.137) 


Hence, comparing the O(t) terms in (2.136) and (2.137) it follows that d(av + Bw) = 
ad(v) + 8Bd(w). M 


Proposition 15. If G is a matrix Lie group, then the representation Ad : G — GL(L(G)) 
induces the representation ad : L(G) > M(L(G)). 
Proof 

If v,w € £(G) then 


Ad (e’)w = ewe” 
(1+ tv + O(t?))w(1 — tv + O(£)) 
w + t[v, w] + O(t?) 


= (I+ tad v + O(t?))w (2.138) 
and hence it follows that 
d tu 
qu e Jio = ad v (2.139) 


as required. M 
We have seen that a representation D of the matrix Lie group G acting on V gives rise 
to a representation d of the Lie algebra L(G) on V. A partial converse is true. 


Definition 33. Suppose that G is a matrix Lie group. Let d denote a representation of 
L(G) on V. Then a representation D is induced locally on G via 


D(g) =e (2.140) 


for those g € G such that g = e”. 

Here we assume that the representation d(v) is realized as a matrix linear transforma- 
tion on V, so that the standard matrix exponentiation e" may be taken. The represen- 
tation D induced by d is generally not globally well-defined, but it is locally well-defined 


on the neighbourhood of the identity on which exp is invertible. 


Proposition 16. The map D given in (2.140) which is locally induced by the representation 
d of C(G) on V defines a representation. 
Proof 

Clearly, D(g) defines a linear transformation on V. 

As I = e^ it follows that D(e) = e%) = e? = 1 where d(0) = 0 follows from the 
linearity of d. 


Also, suppose that g1, g2 have gı = €"!, g2 = e. Then by the BCH formula we have 


192 = gita lei va]... (2.141) 
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where ... denotes a sum of higher order nested commutators by Proposition 13. Hence 


D(g1g2) = edi bv 5 vi va]... ) 

ed) d(v2) - 5 4([v1,v2]) "C... ) (by the linearity of d) 
= ed (i) d(v2) 4 [d(v1),d(va)]+... (adie (2.113) 

= e401) e4(v2) 


D(g1)D(92) (2.142) 


So D is at least locally a representation. Note that we have made use of the fact that all 
higher order terms in the BCH expansion can be written as sums of commutators, together 
with the property (2.113) of representations of £(G) in proceeding from the second to the 
third line of the above equation. W 


Proposition 17. Suppose that G is a matrix Lie group. If D is a unitary representation 
of G on V then the induced representation d of L(G) on V is antihermitian. 

Conversely, suppose d is a antihermitian representation of L(G) on V, then the (lo- 
cally) induced representation D of G on V is unitary. 
Proof 

First suppose that D is a unitary representation. 

Recall that d satisfies D(e'*) = I + td( X) + O(t?) for t € R and X € L(G). 

As D is unitary it follows that D(e"*)D(e*)! =I. 

Hence (I + td( X) + O(2))(I + td(X) + O(t2))! = I, so expanding out, the O(t) terms 
imply d(X) + d( X)! — 0, i.e. d is antihermitian. 

Conversely, suppose that d is an antihermitian representation of L(G) on V. Let D 
denote the (locally) induced representation of G on V. 

Suppose that g € G is given by g = e* for X € L(G). Then 


D(g) = et) (2.143) 
Then 
D(gyD(g)! = e! 9 (ey! 
— gd QOO? 
E ed X e-4(X) 
=1 (2.144) 


Hence D(g) is unitary Ml. 
There are directly analogous definitions for irreducibility of representations of Lie al- 
gebras 


Definition 34. Suppose d is a representation of L(G) acting on V. A subspace W C V is 
called an invariant subspace if d(X)w € W for all X € L(G) andw € W. 


Definition 35. Suppose d is a representation of L(G) acting on V. Then d is reducible 
if there is an invariant subspace W of V with W Z 0 and W z V. If the only invariant 
subspaces of V are 0 and V then d is called irreducible. 
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Definition 36. A representation d of C(G) is called totally reducible if there exists a direct 
sum decomposition of V into subspaces W;, V = Wi Q W2 @ ... PW where the W; are 
invariant subspaces with respect to d and d restricted to Wi; is irreducible. 


Proposition 18. Suppose that G is a matriz Lie group. If D is a representation of G on V 
with invariant subspace W, then W is an invariant subspace of the induced representation 
d of L(G) on V 

Conversely, suppose d is a representation of L(G) on V with invariant subspace W; 
then W is an invariant subspace of the (locally) induced representation D of G on V. 
Proof 

Suppose that D is a representation of G on V with invariant subspace W with respect 
to D. Let d be the induced representation of £(G) on V. If w € W and X € L(G) then 


d(X)w = 5 (D(e'X)) ow = (D(X Jw) (2.145) 


As D(e'*)w € W for all t € R it follows that d(X)w € W. 

Conversely, suppose that d is a representation of L(G) on V, and W is an invariant 
subspace of V with respect to d. Let D be the locally defined representation of G induced 
by d. Then if g € G is given by g = e* for some X € L(G) then if w € W, 


D(g)w = 4 w = 5 TeXu (2.146) 
n=0 ^ 


However, as W is an invariant subspace of V with respect to d, it follows that d”(X)w € W 
for all n € N. Hence D(g)u c W. M 

Note that in this proof we made implicit use of the closure of W. 

'There is also a natural concept of equivalent representations of Lie algebras. 


Definition 37. Suppose d is a representation of L(G) acting on V1, and B : Vi > V is 
an invertible linear transformation. Define dX) = Bd(X)B-! for X € L(G). Then d and 
d are said to be equivalent representations. 

Exercise: Show that d defined above is a representation of L(G). Also show that if Dj, 
Də are equivalent representations of G on vector spaces Vi and V2 then the corresponding 
induced representations dı and dz on Vı and V2 are equivalent; and conversely, if dj and 
dz are equivalent representations of L(G) on V; and V2 then the locally defined induced 
representations Dı and D» of G are equivalent. 

Note that Schur's lemmas may be applied to representations of Lie algbras in exactly 
the same way as to representations of Lie groups. 

Hence we have shown that there is (at least locally) a 1-1 correspondence between 
irreducible representations of the Lie group G and the Lie algebra L(G). This is useful, 
because it enables us to map the analysis of representations of G to those of L(G), and 
thus the problem reduces to one of linear algebra. 


Proposition 19. Suppose that dı and dz are representations of L(G) acting on Vi and 
V2 and let V = Vi G9 V2. Define d = di & 1-- 1G də as a linear map on V. Then d is a 
representation of L(G) acting on V. 
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Proof 
If 4, we € L(V) and o, B are scalars and v1 & v2 € V then 


d(awı + Bw)v1 & v2 = di(aw + B'wa)vi Q v2 + vı Q do(ow + Bwa)va 
= (adi(w1) + Bdi(wa))vi Q v2 + v1 & (ada(w1) + Bdo(w»))vo 
ad,(w1)v1 Q v2 + Bdy(we)v1 ® v2 
avı Q do(w1)vs + Bv Q do(we)ve 
a(di(wi)vi & v2 + vj Q do(w1)va) 
B (Bd3(wa)vi & v2 + v4 Q d2(w2)v2) 
ad(w;)vi & v2 + Qd(wa)va Q vo (2.147) 


li sz 


I] + 


so d is linear on L(G). Also 


d([w1, we])v1 Q vg = dı ([w1, we])v1 ® vg + vida ([w; , w3]) vo 
= (di(w1)d1(w2) — di(w2)di(w1))v1 & vo 
+ vı Q (do(wi)da(w3) — do(wa)da(wi))vo 
= di(w1)d1(w2)vi Q v9 — di(wa)di(w1)vi Q v2 
+u g də(wı)d2(w2)v2 — V1 g də(w2)d2(w1)v2 (2.148) 


Also note that 


d(wi)d(w2)vi & vy = d(w1ı)(dı(w2)vı & v2 + v2 & d2(w2)v2) 
= dı(wı)dı(w2)v1 & v2 + di(we)v1 & d2(w1)v2 
+ dı(wı)vı Q do (w»)vo + V1 ® do (w3)do(w2)vo (2.149) 


where the sum of the second and third terms in this expression is symmetric in w and w2. 
Hence 


d(wi)d(wo)vi Q vg — d(wa)d(wi)vi Q vg = di (w1)di(w2)vi Q vg — di(w3)d4(w1)vi Q vg 
+ v1 Q d2(w1)d2(w2)v2 — vı Q do(wa)d2(u1)vo 
= d([wi, wa])vi & vo (2.150) 


as required W. 


Proposition 20. Suppose that Dı and Də are representations of matriz Lie group G on 
Vi and V2 with induced representations of L(G) on Vi and Vz denoted by dı and do. Let 
D = Di ®& D» denote the representation of G on the tensor product Vi Q) V2. Then the 
corresponding induced representation of L(G) on Vi &9 Vo is d 2 di & 12-16 da. 

Proof Suppose w € £(G), then expanding out in powers of t; 


D(ef")u, & v = D1(e")v; & Do(ef")us 
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= (vi + tdi(w)vi + O(t?)) Q (v2 + tdo(w)vo + O(t?)) 
= v1 @ vat t(di(w) vy G v3 + v1 ® d2(w) ve) + O(t?) 
= v & v» + t(dy @1 41 do)ui Q v2 + O(t?) (2.151) 


and hence from the O(t) term we find the induced representation d = di & 1 4- 16 d2 
as required Bl. 
2.17 The Killing Form and the Casimir Operator 


Definition 38. Suppose that G is a matriz Lie group with Lie algebra L(G). Then for 
X € L(G), ad X can be realized as a matrix linear transformation on L(G). The Killing 
form & is defined by 

K(X,Y) = Tr (ad Xad Y) (2.152) 


Suppose that Tą is a basis for L(G). Then from the Killing form one obtains a sym- 
metric matrix 
kao = ESSE) (2.153) 


Denote the matrix elements of ad T; by (ad T;),^; then note that 


(ad Ta) Ts = [as Tij = Can Te 
= (ad Ta) Te (2.154) 


Hence (ad Ta)5° = ca^. So it follows that 
Kab = Gi Coe? (2.155) 


Lemma 4. The Killing form is associative: K(X, [Y, Z]) = &(LX, Y], Z) 
Proof 


K(X, [Y, Z]) = Tr (ad Xad [Y, Z]) 
= Tr (ad X(ad Yad Z — ad Zad Y)) 
= Tr (ad Xad Yad Z) — Tr (ad Xad Zad Y) 
= Tr (ad Xad Yad Z) — Tr (ad Yad Xad Z) 
= Tr ((ad Xad Y — ad Yad X)ad Z) 
= Tr (ad [X, Y]ad Z) 
K([X, Y], Z) (2.156) 


as required. W. 

As Kap is symmetric, one can always choose an adapted basis for £(G) in which Kap is 
a diagonal matrix, and by rescaling the Lie algebra generators, the diagonal entries can be 
set to +1, 0 or —1. 


Definition 39. The Killing form is non-degenerate if Kab has no zero diagonal entries in 
the adapted basis. The Lie algebra L(G) is then called semi-simple. If all the diagonal 
entries are —1 then L(G) is said to be a compact Lie algebra. 
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Lemma 5. Suppose that L(G) is semi-simple. Define Cabe = ca kae (i.e. lower the last 
index of the structure constants with the Killing form). Then cay. is totally antisymmetric 


in a,b, c. 
Proof 
Note that 
K(Ta, [Eae Te) = Gs ACE Ta) = Cc Kad = Cbhca (2.157) 
and 
K([Ta, To], Te) = Cas (Ta, Te) = Cab Kde = Cabe = —Cbac (2.158) 


But by associativity of x, (2.157) and (2.158) are equal. Hence Cbca = —Cbac- AS Cabe 1S 
skew symmetric in both the first two and the last two indices, it follows that cap. is totally 
antisymmetric. M 


Definition 40. Suppose that C(G) is a Lie algebra with non-degenerate Killing form, and 
d is a representation of L(G) on V. The Casimir operator C of L(G) is defined by 


C = - M 1y?ha(r;)a(m) (2.159) 
a,b 


where &-! denotes the inverse of the Killing form. 
Proposition 21. The Casimir operator commutes with d( X) for all X € L(G) 


Proof 
It suffices to show that [C, d(Ta)] = 0 for all Ty. 


Note that 

[4(74), C] = — X (175) (ld(Ta), d(T) d(T-)]) 
ces Yu )" (ta(;), d(To)]d(Te) + (Ti) [d(Ta), d(T-)]) 
E 37 )"* (d[Za, Th]d(Te) + d(Ty)d[Ta, Tel) 
= 3 tybe (aus  d(Ty)d(Tz) + cas d (To)d(T?)) 
= etij — es d(T;)d(T;) 
- 0 (2.160) 

where we have defined 
PLN ED (2.161) 


and we note that c,"^ is antisymmetric in b,c. MI 
Note that if d is irreducible then by Schur's second lemma, C must be a scalar multiple 
of the identity. 
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If L(G) is compact, then working in the adapted basis, C takes a particularly simple 
form 


C= M a(T;)a(T;) (2.162) 
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3. SU(2) and Isospin 


3.1 Lie Algebras of SO(3) and SU(2) 


We have shown that the Lie algebra of SO(3) consists of the 3 x 3 antisymmetric real 
matrices. 


A basis for L(SO(3)) is given by 


0 0 0 0 0 1 Oi -1 0 
T=| 0 0 -1 T=| 0 0 T;,—| 1 0 Of (31) 
0 1 0 —1 0 0 i 0 0 


Exercise: Show that 
ee Tij = €abclc (3.2) 


where Cabe is totally antisymmetric and €123 = 1. 
Next consider SU(n). Let M(t) be a smooth curve in SU(n) with M(0) = I. Then 
M(t) must satisfy 
det M(t)=1 M(t)M(t)' =I (3.3) 


Differentiating these constraints we obtain 


_,dM(t) dM (t) dM(t)! 
Tr ( M(t)7" =0 M(t)! + M(t =0 3.4 
(uon 22) 7 mata mZ (34) 
Setting t = 0 we find 
Trm=0 m++mi=0 (3.5) 
where m = B) i-o. Hence £(SU (n)) consists of the traceless antihermitian matrices. 


Exercise: Verify that if X and Y are traceless antihermitian square matrices then so is 
[X, Y]. 
It is convenient to make use of the Pauli matrices c; defined by 


(PQ ed oe 


which satisfy ogo» = dapl + ?€apcoc. 
Then a basis of traceless antihermitian 2 x 2 matrices is given by taking Ta = — ioa. 
It follows that 


[os Tij = €abclc (3.7) 


Comparing (3.2) and (3.7) it is clear that SO(3) and SU(2) have the same Lie algebra. 
We might therefore expect SO(3) and SU(2) to be similar, at least near to the identity. 
We shall see that this is true. 

Exercise: Using this basis, show that the Lie algebra C(SU(2)) is of compact type. 
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3.2 Relationship between SO(3) and SU(2) 


Proposition 22. The manifold SU(2) can be identified with S?. 
Proof 
Consider 


D= | s r) € SU(2) (3.8) 
» 
for a, p, u, v € C. 
Then UU! — I implies that () be orthogonal to B with respect to the standard 
v 


«a. : : ; 
) in C? is a 1-dimensional 


inner product on C?. As the orthogonal complement to | B 


complex vector space spanned by F ) it follows that u = —o0 3, v = cà for c € C. 
à 


We also require that () and B have unit norm, which fixes 
» 


lc = le? +18? 2 1 (3.9) 


Finally, the constraint det U = 1 fixes o = 1. 


v=( a (3.10) 


where aà + 33 = 1. Such U is automatically an element of SU (2). Hence this is the form 
of the most generic element of SU (2). 


Hence we have shown that 


Set o = yo + iys, B = —y2 + iyı for yo, yi, Y2, ya € IR. Then it is straightforward to see 
that 
U = yol + tynon (3.11) 
and aa + 33 = 1 is equivalent to us +y? +y + ys =Lie-ges*: 
This establishes a smooth invertible map between SU(2) and S?. M 
Note that this map provides an explicit way of seeing that SU (2) is connected, because 
S? is connected. 


Proposition 23. There is a 2 — 1 correspondence R : SU(2) > SO(3) between SU(2) and 
SO(3). The map R is a group homomorphism. 
Proof Suppose U € SU(2). Define a 3 x 3 matrix R(U) via 


R(U)mn = 2m (OmUonU") (3.12) 


By writing U = yol+iymom for yo, ym € R satisfying yd --ypyp = 1, it is straightforward 
to show that 


Rmn = (ye EX Yo) Omin + 2€mnqYog + 2YmYn (3.13) 
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(here we have written Rmn = R(U)mn). 
It is clear that if yp = 0 for p = 1,2,3 so that U = +I, then R = I, so R € SO(3). 
More generally, suppose that Ypyp Æ 0. Then one can set yo = cos o, yy = SİN Zp for 


0 « o « 2 and o zZ s. Then the constraint y% + Upip = 1 implies that 252, = 1, i.e z is a 
unit vector in R3. The expression (3.13) can be rewritten as 
Rmn = COS 200 pay Sin 20654425 + (1 — 60820 85725 (3.14) 


It is then apparent that 


Rz mE (3.15) 


and if x is orthogonal to z then 


Rig Pip = COS 20954 Sn 20€ uana (3.16) 


The transformation R therefore corresponds to a rotation of angle 2a in the plane with 
unit normal vector z. 

It is clear that any non-trivial rotation in SO(3) can be written in this fashion. How- 
ever, the correspondence is not 1 — 1. To see this explicitly, suppose that two rotations 
corresponding to y € S? and u € S? are equal. Then 


(ye = YpYp)ðmn + Z€mnqV0Uq + 2YmYn = (u = UpUp)Omn + 2€mnqoug + 2UmUn (3.17) 


where Ya + YpYp = us + UpUp = 1. 
From the antisymmetric portion of this matrix we find yoy, = uot. 
From the diagonal elements with n = m, we see that 


Vo — YpYp + 2(ys)^ = uj — UpUp + 2(Um)? (3.18) 
where p is summed over but m is fixed. Summing over m we find 
3yó — YpYp = 3ug — Up Up (3.19) 


where p is summed over; which together with Và + YpYp = ug + UpUy = 1 implies that 
yê = ug and YpYp = UpUp (sum over p). Substituting back into (3.18) we find y2, = u2, for 
each m = 1,2,3. 

Suppose first that yo 4 0, then uo 4 0, and it follows that yo = tuo and yp = cu, for 


each p — 1,2,3 (with the same sign throughout). 

Next, suppose yo = 0. Then up = 0 also, and yg ys = UmUn for each m,n = 1,2,3. 
Contracting with yn we get (ynyn)Um = (YnUn)Um for m = 1,2,3. As YnYn = 1, this implies 
that Ym = NUm for m = 1,2,3 where A is constant. Hence, (1 — A?)umun = 0. Contracting 


over m and n then gives 1 — A? = 0, so A = +1. Therefore y, = cu, for p = 1,2,3 (with 
the same sign throughout). 
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Hence, we have shown that each R € SO(3) corresponds to two elements U and 


—U € SU(2). These two elements correspond to antipodal points +y € S?. This establishes 
the correspondence. 
It remains to check that R(U1U2) = R(Uj)R(Us) for U1, U2 € SU(2). Note that on 
writing 
Ui; = yol2 + iynon, | Uo — wo +iwnon (3.20) 


for yo, yp, W0, Wp € R satisfying Va + YpYp = we + WpWp = 1 then 
U {U2 = uole + iunon (3.21) 


where uo = yowo — YpWp and Um = YoWm + WoYm — EmpqYpWq Satisfy ue + UpUp = 1. It then 
suffices to evaluate directly 


and compare this with 


R(Ui)mpR(U2)pn = [(yô — Yeye) Smp+2€mpqYoYq+2YmYp| [(wg — wy, Ww) Opn +2€pnrWoOWr -2wpw; | 

(3.23) 
On expanding out these two expressions in terms of y and w it becomes an exercise in 
algebra to show that R(U1U2)mn = R(U1)mpR(U2)pn as required. M 

Exercise: Verify the identity R(U1U2)mn = R(U1)mpR(U2)pn. 

It is conventional to write SU(2) = S? and SO(3) = S?/Z»5, where S?/Z»5 is the 3- 
sphere with antipodal points identified. SU(2) is called the double cover of SO(3); and 
there is an isomorphism SO(3) = SU(2)/Z3. 

It can be shown (using topological methods outside the scope of this course) that 
SU(2) and SO(3) are not homeomorphic. This is because SU(2) and SO(3) have different 
fundamental groups 7,. In particular, as SU(2) & S, and SS? is simply connected, it 
follows that 71(SU(2)) is trivial. However, it can be shown that 71(SO(3)) = Za. 


3.3 Irreducible Representations of SU(2) 


The standard basis of C(SU(2)) which we have been using is T; = —2o4. Suppose that 
d is a finite-dimensional irreducible representation of £(SU(2)) on V, where V is a vector 
space over C. 

Define 


Ja —id(Ts, J = —A(d(y) + id(T5)) (3.24) 


Then 


[J3, J+] = +J4, [J}, J-] = J3 (3.25) 


As V is a complex vector space, there exists an eigenstate |¢) of J3 with some eigenvalue 
A, and using (3.25) it follows that 
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J3J+ |b) = (^ € 1)J« |o) (3.26) 


and so by simple induction 


Ja(J)" |) = (A n)(Jx)" |Ø) (3.27) 


for non-negative integer n; ie. J+ are the standard raising and lowering operators. 
The eigenvalues of J4 are called weights. We have therefore shown that (J+)” |o) either 
vanishes or is a J3 eigenstate with eigenvalue A + n. 


Consider the states (J,)”"|¢). If non-vanishing, these states are linearly independent 
(as they have different J3 eigenvalues). Hence, as V is finite dimensional, there exists a J3 
eigenstate (J.,.)" |¢) which we will refer to as |j, 7) with eigenvalue j such that J4 |j, j) = 0. 


Definition 41. j is called the highest weight of the representation. In the context of particle 
physics, it is called the spin. 

Note that by acting on |j, j) with J_ other distinct eigenstates of Jz are obtained. As 
we are interested in finite dimensional representations, it follows that (J_)% |j, j} = 0 for 
some positive integer N (otherwise one could just keep going and the representation would 
be infinite dimensional). Let k +1 be the smallest positive integer for which this happens, 
and set |Y) = (J..)* |j, j), so, by construction, J- |y) = 0. 

Define for £ = 0,...,k 


Ivo = (4-Y* 15, 3) (3.28) 
Then |v;) are (non-vanishing) j — £ eigenvectors of J3. 
Note that 
J4 bo) = J4(LY lj j 
GI) ad) 
(Fit ESO QE T 3) 
= (Jat J-J (Y 03,5) 
SG- CDE 5, 3) + J-J4 (0) 71 (4,3) (3.29) 


Repeating this process, one finds by induction 


J4 vo) = OM HOA FU MU |Ui-1) 
= lj- Piu — 1)) [Vi-1) (3.30) 


In order to constrain j recall that J_ |j) = 0, so 


0 — J4J-— |i) 
= ([J4, J-] + J-J+) |n) 
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= (J3 + Jod.) |) 


= ( — E) fda) + J- (EG — (k — 2) les) 
= (j — k+ KG — Z (k 1) lb) 
= 50 DG - 5) lun) (3.31) 


As k is non-negative, it follows that k = 27 
Using this it is straightforward to prove the following 


Proposition 24. V = span{J* |j, j), JET! |j, j) ,...,|j,7)} and the highest weight state is 
unique. 
Proof 

Consider the vector space V’ spanned by |v;) for i = 0,...,k. This is an invariant 


subspace of V with respect to the representation d. As the representation is irreducible on 
V it follows that V = V'. In particular, J3 is diagonalizable on V and each eigenspace is 
1-dimensional. 

To prove uniqueness suppose that |¢) € V satsfies J4 |o) = 0. As |o) € V it follows 


that we can write 
2j 


16) = M ai(4-Y 1,3) (3.32) 
i=0 
for constants a;. Applying (J)?! to both sides of this equation implies ao; = 0. Then, 
applying (J;)?/~1 implies a2;-1 = 0. Continuing in this way, we obtain aj = a3 = =- = 
42j;—1 = a2; = 0, and so |ó) = ao|j,j). So the highest weight state in an irreducible 
representation of C(SU(2)) is unique (up to scaling). W. 

Hence, we have shown that j is half (non-negative) integer, and the representation is 
2j + 1-dimensional. The irreducible representations are therefore completely characterized 
by the value of the weight j. 

It is possible to go further, and prove the following theorem (the proof given is that 
presented in [Samelson]): 


Theorem 1. **NON-EXAMINABLE** Suppose that d is a representation of C(SU(2)) 
on a complex vector space V. Then V can be decomposed as V = Vi ®--- ® V, where V; 
are invariant subspaces of V such that d restricted to V; is irreducible. 
Proof Given in Appendix A. 

Note that we have not as yet assumed that the representation originates from a unitary 
representation of the Lie group. However, in order to compute normalized states, it will 
be convenient to assume this; so in particular, the d(T;) are antihermitian and hence 


=, JÅ = J (3.33) 


We will assume that the highest weight state |j, 7) is normalized to 1. 


Exercise: 
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Show using (3.30) that (e| vy) = §(27 — £+ 1) (ii| Ye-1), and hence that 


Ne = (Wel Ye) = sg 


It is conventional to define normalized states |j, m) for m = —j,...,j via 


1 
|j, m) = ——— Iij-m) 


VN 


(3.34) 


(3.35) 


the first label denotes the highest weight value j, the m label denotes the eigenstate 


of Js, J3 |j, m) = m |j, m). These satisfy (check!) 


Lm) = VG mG —m+1)|j,m—1) 


2 
l JG+mG—m +1) |j,m) 


v2 


J4 |j,m—1) = 


(3.36) 


Exercise: Show that the Casimir operator C is given by C = —$(J4J_+J_J4.+(J3)”) 


and satisfies 


. l4. ; 
C |j, m) = -340 t 1) |j, m) 


3.3.1 Examples of Low Dimensional Irreducible Representations 


It is useful to consider several low-dimensional representations. 


ES I. A normalized basis is given by [5 1) and |, —3), with 


11 
Ji l=,-= 0 
poe 1 1 1 
J. = 
+15 2 Ji la»! 
and 
11 1 1 1 
L3 77g 
1 1 
J_|=,-=) 20 
l5. -5) 


ii) j= 1. A normalized basis of states is |1, 1), |1,0) and |1, —1) with 


J,|1,1) 20 
J,|1,0) = |1,1) 
J+ |1, 21) z |1,0) 
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(3.37) 


(3.38) 


(3.39) 


(3.40) 


and 


J_|1,1) = |1,0) 
FN 0) 1 0) 
J_|1,-1) =0 (3.41) 


iii) j = 3. A normalized basis of states is |3, 3), |2, 5), |3, — 3) and |3, —3). with 


^ py 3,3 3 
ea Hr 
Tlg73) = Va larg? (92) 
and 
3 3 3,31 
A M 22 
7T- Ig! = V2l5.-3) 
Lipa 515,5) 
L|5,-2) =0 (3.43) 


3.4 Tensor Product Representations 


Suppose that dı and d» are two irreducible representations of C(SU(2)) over vector spaces 
V(1), V(2). Let V = V(1) G9V (2) be the tensor product, and d = dı & 1 + 1G do be the 
representation on V. 

We wish to decompose V into irreducible representations of d (i.e. invariant subspaces 
of V on which the restriction of d is irreducible). 

Denote the states of V(1) by |j1, m) for m = —j1,...,j1 and those of V(2) by |j2,n) 
for n = —j»,...,j2, where jı and jz are the highest weights of V(1) and V(2) respectively. 
Note that |j1, m) Q |jo, n) for m = —j1,..., j1 and n = —jo,...,J2 provides a basis for V. 

Set 


J()-id(m),  J()- Jg C) + id (T5) (3.44) 
and . 
Ja(2) = ido(T;), | Ja(2)— zen) aTa (3.45) 
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with 
Ja = J3(1) 8&1 -- 1 J3(2), Jz = J4(1) 81-16 J4(2) (3.46) 
Exercise: Check that J+ and J3 satisfy J = Ja, Ji = J+ and 


[/ Ja] 2 £J4, [Jy,J_]= J (3.47) 


In order to construct the decomposition of V into irreducible representations, first note 
that if |»(1)) € V(1) is a state of weight m4 with respect to J3(1), and |v(2)) € V(2) is 
a state of weight m» with respect to J3(2) then |v(1)) & |v(2)) € V is a state of weight 
mı +m, with respect to J3, i.e. weights add in the tensor product representation. 

Using this, it is possible to compute the degeneracy of certain weight states in the 
tensor product representation. In particular, the maximum possible weight must be jı + j2 
which corresponds to |j1, j1) 9 |j1, j2)- 

Consider the weight 7; + j2 — k for k > 0. In general, jı + jg — k can be written as a 
sum of two integers m4 + m5, for mj € (—j1,..., 1) and ma € {— j2, ..., j2} in k +1 ways: 


jitje—k = (ji-—k) +e 
= (j1 -k+1)+(jo-1) 


mesa 
= ji + (jo—k) (3.48) 


provided that jı — k > —jı and jo — k > —j2, or equivalently 


k < 2min(j1, j2) = ji + j2 — | — Jal (3.49) 

Consider the state of weight jı + j2. There is only one such state, and there is no state 
of higher weight. Hence it must be a highest weight state of a subspace of V on which 
the tensor product representation is irreducible. This irreducible subspace has dimension 
2(j1 + j2) + 1, and is denoted by Vj+j Write V = V' @ Vj, cis. 

Next consider the states in V’. The highest possible weight is jı + j2 — 1. In V there 
were two linearly independent states of this weight, however one of these lies in V;,+;,, and 
does not lie in V’. The remaining state of weight jı + j2 — 1 is of highest weight in V’ and 
hence must be a highest weight state of a subspace (of dimension 2(j; + j2) — 1) of V’, on 
which the tensor product representation is irreducible. Denote this subspace by Vj +j2-1- 
Note that by construction Vj,+j. N Vj,+j.-1 = (0). 

One can continue inductively in this fashion: for each j = |j1—j2|.|j31—J2|- 1... gi tJe 
there is a subspace V; (of dimension 274-1) on which the tensor product representation is ir- 
reducible, with highest weight j, and V;j'YV; = {0} if j A k. In fact these subspaces account 
for all of the elements in V. To see this, compute the dimension of V, (D QQ Vis: 

We find 


jitje 


dim Vii jal p RN P Vitj = 5 (25 + 1) 


j-|-—32l 
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jitje—|j1—Jja| 
= MP (Als - gol +n) +1) 


n=0 
(1 + 291)(1 + 252) 
= dim V (3.50) 


Hence we have decomposed V = V, 4 D- (D Vj Lj, into irreducible subspaces V; 
where V; has highest weight j and is of dimension 27 + 1. 


3.4.1 Examples of Tensor Product Decompositions 


We shall consider two simple spin combinations which are of physical interest. 

Firstly, take the spin 1/2 & spin 1/2 composite system. As jı = jo = 5 there are two 
possible values for the composite spin, 7 = 1 or j = 0, and the tensor product space is 
4-dimensional. 


For the j = 1 states, the highest weight state can be written as 


11 11 
1,1) =J=,= Am 3.51 
IMEIDPTIDES (3.51) 


Applying J_ to both sides we find 


1,1 1 11 11 1 1 


1 = | A ; 3.52 
l9) = a (Igo) @ lpg) + Ig) @ lg 73) (3.52) 
and applying J_ once more 
1 1 1 1 
1,-1) = 3.53 
It.) = 155-5) € I5. 73) (3.53) 


The remaining possible spin is 7 = 0. This has only one state, which must have the 
form 


1 
2 
for constants co, c1 to be fixed. Applying J, to both sides we get 


11 1 1 
= H 3 3.54 
10,0) — o5, 5) 815, 75) tals ) (3.54) 


1 1 1 

=0 3.55 
(co +1) 565) 814,5) (3.55) 

so cy = —co. Then on making the appropriate normalization we find 

1 1 1 11 11 1 1 

0,0) — ; ; 3.56 
0,0) = a (I5. 75) 9l) I gH 8g 3) (3.56) 
Next, take the spin 1 & spin 1/2 composite system. As jı = 1, j2 = i there are two 
possible values for the composite spin, j — i or j = 3, and the tensor product space is 


6-dimensional. 


For the j = 3 states, the state with greatest weight is 


3 3 11 
2 2) = 1,1) @|=,- ; 
55) = 11) @ 1555) (3.57) 
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Applying J_ to both sides gives 


3 1 2 pp. 1 1 1l 
c ym ubt 
loss! "H 0) 85,3) * el »1) @|5,—5) (3.58) 


Further applications of J_ give 


3 1 1 11 2 1 1 
= 1,—1 | 1, ap ; 
lj 7g) = gl -0 els a) eke] (3.59) 
and 
3 3 1 1 
rcc el eee eg e ; 
Pe (3.60) 


For the j = i states, the state with maximal weight can be written as a linear combi- 


nation 
I2) = olhei- i0) ® I>, 2) (3.61) 
2°92 = & |4, PE 2 cj |4, 2°92 x 
for some constants co, c1 to be determined. Then 
11 1 11 
elg) = (r+ 759) 10 @ 1505) (3.62) 
so € = EN On making a unit normalization, we also fix co and find 
11 2 1 1 1 11 
E spec pde edere = 3.63 
In --V2n nell. 5) + EDS (3.63) 


and applying J_ to both sides this gives 


1 1 1 1 1 2 11 
e PANI ' jd 64 
ls 73) = gl els 3) /in-neidn (3.64) 


3.5 SU(2) weight diagrams 


One can plot the weights of an irreducible C(SU(2)) representation on the real line; for 
example: 


The spin j24 SU(2) irreducible representation 


--— J @ @ @ @ @ @ ©@ 
6 -5 -4 -3 2 -1 0 1 2 3 4 


The weight diagrams of irreducible representations have the following properties 


i) The diagram is reflection symmetric about the origin. 


ii) The weights are all half integer; and are distributed with unit distance between each 
weight. The highest weight is j for 27 € N, and the lowest weight is —j, and there 
are no “holes” in the weight diagram- —j,—j + 1,...,j — 1, j are all weights. 
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ii) Each weight has multiplicity 1. 


One can also plot the weight diagram of a generic (not necessarily irreducible) represen- 
tation. For example, the weight diagram of the tensor product (jı = 1)e (ja = 3) (Ja = 3) 
eight dimensional tensor product representation is 


| | + - | ve o0 @ A 3A. 
-3 -2 -1 0 1 2 3 


This has a highest weight j = 3 and a lowest weight -3 both with multiplicity 1, and 


weights +4 each with multiplicity 3. In general, a non-irreducible representation will have 


a highest weight, but it need not be of multiplicity 1. For a generic weight diagram 


i) The diagram (together with weight multiplicities) is reflection symmetric about the 
origin. 


ii) The weights are all half-integer. 


ii) Each weight need not be of multiplicity 1. However, as one proceeds from a particular 
weight towards the origin (in unit steps from either the left or the right), the weight 
multiplicities do not decrease. 

3.6 SU(2) in Particle Physics 
3.6.1 Angular Momentum 


The orbital angular momentum operators La acting on wavefunctions are given by 


. o 
L,-— itai 5z (3.65) 
'These operators satisfy 
|La, Li = labo Lc (3.66) 


and hence correspond to a (complexified) representation of SU(2). Particles also carry a 
spin angular momentum S, which commutes with the orbital angular momentum [L, S] = 0. 
The total angular momentum is defined by J = L + S. 


3.6.2 Isospin Symmetry 


It is observed that the proton and neutron have similar mass, and also that the strong 
nuclear forces between nucleons are similar. Heisenberg introduced the concept of a SU(2) 
isospin symmetry to systematize this. Particles are grouped into multiplets of isospin 
value I (previously called j) and labelled by the weights, which are the eigenvalues of I3. 
Originally, this was formulated for nucleons, but later extended to describe all mesons and 
baryons. 

Particles in the same isospin multiplet have the same baryon number, the same content 
of non-light quarks, the same spin and parity and almost the same mass. Isospin is a 
conserved quantum number in all known processes involving only strong interactions: it is 
related to the quark content by 
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I = ENU) - N(u) — (N(d) - N(d))) (3.67) 


Isospin symmetry arises in the quark model because of the very similar properties of 
the u and d quarks. 
Examples: 


i) Nucleons have isospin IJ = 53 the proton has I3 = i, and the neutron has /3 = -i 


1 1 
n PE. 
BS lg: z (3-68) 
ii) The pions have I = 1 with 
7 = |l,—1) 
T? = |1,0) 
at = |1,1) (3.69) 


iii) The strange baryons have J = 0 and I = 1 


5 = |1,—1) 
E° = |1,0) 
Src DS. (3.70) 
and 
A? = |0, 0) (3.71) 


iv) The strange mesons lie in two multiplets of J = i 


kispe 
i 

KY i1 A2 
zz) (3.72) 

and 

= 1 1 

K ERr. 

= 11 

K? -|-,— ; 
zz) (3.73) 


K= are antiparticles with the same mass, but are in different isospin multiplets 
because of their differing quark content. 


—060- 


v) The light quarks have J = i 


a 
II 

l 
NI =| 
~~" 


g 
I 


(3.74) 


NI BD] = 
NI = 
rr ec 


and all other quarks are isospin singlets J = 0. 


3.6.3 Pauli’s Generalized Exclusion Principle and the Deuteron 


Consider first NN nucleon-nucleon bound states. From the previously obtained decompo- 
sition of the (Iı = 1/2) ® (I2 = 1/2) tensor product we find the following isospin states 


1 
|1,1)=pp, |1,0) = wr: |1,-1) = nn (3.75) 


which are are symmetric under exchange of isospin degrees of freedom, and the remaining 
state is 


0,0) = UL - pn) (3.76) 


which is anti-symmetric under exchange of isospin degrees of freedom. 

The deuteron d is a pn bound state, which has no pp or nn partners. There is therefore 
only one possibility; d — |0, 0). 

In general, the total wavefunction for a NN state can be written as a product of space, 
spin, and isospin functions 


w = v(space)v(spin)v(isospin) (3.77) 


The generalized Pauli exclusion principle requires allowed NN wave-functions be anti- 
symmetric under exchange of all degrees of freedom. As w(isospin) is anti-symmetric, 
V(space)v (spin) must be symmetric- in fact (space) is symmetric (the nucleons in the d 
are in a £ = 0 angular momentum state), and v(spin) as also symmetric, as d is a spin 1 
particle. 


3.6.4 Pion-Nucleon Scattering and Resonances 


Isospin can be used to investigate scattering processes; consider for example pion-nucleon 
scattering processes. From the decomposition of the (Ij = 1) & (Ig = 1/2) tensor product 
we find 
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—2.)-T7Tn (3.78) 


and 


1 1 PN 
Ia) = "E p— yg (3.79) 


These equations can be inverted to give 


33 
mp = |.) 
gode. leah A 
0, _ 
p= beh 
jud 211 
wn = (3,4) - als.) 
3 
pe 1 1, fF 4, 
MRE TI 39-9 
2p ga. rap equ. cd 
0, 
T n = 3 l2 5! B? 2 
ES. 


Consider mN scattering. The scattering is described by a S-matrix, which, in processes 
dominated by strong interactions is taken to be isospin invariant: [J;,S] = 0 and so by 
Schur's lemma 


(I, m/| S |Z, m) = O(L) 677mm! (3.81) 
The cross sections of the scattering processes are given by 
o(in > out) = K| (in| S |out) |? (3.82) 
for constant K. Hence 
+ + 32 
o(n*p > x*p) = KÈ) (3.83) 
- 2 3 1 

(n?n — «7p) = KIG(5) - 665)? (3.84) 

2 E 1 3 Is 
o(p — 279) = SKIAS) + 24) (3.85) 


For all three of these processes, a marked resonance is measured at approximately 1236 
Mev. The ratio of the cross-sections of these resonances is 


o(ntp— atp):o(n°n > ap): 0(m pomp) =1: 


(3.86) 


which is consistent with the supposition that the resonance corresponds to a particle of 
isospin 3 (and so |¢(3)| > |¢(%)|). This particle is the A particle, which lies in an 


isospin J = 3 multiplet with states A~, A9, At, A** having weights Iz = —3, 


—118 
20r 202 


respectively. 
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3.7 The semi-simplicity of (complexified) C(SU (n + 1)) 


To proceed, define the n+ 1 x n+ 1 square matrices Ej; by (Ei j)pa = óipójg. Recall that 
LE (SU (n 4-1)) consists of the traceless antihermitian matrices, and is therefore spanned over 
R by i(Eii— Epping) for i = 1,...,n and By — Eji, i(Eij + Eji) for? € j. Hence, on 
complexification, the Lie algebra is spanned by traceless diagonal matrices together with 
the Ei j for i z Jj. 

Suppose that h = diag (a1, a2,...,an+1), >a; = 0 is a traceless diagonal matrix. 
Then if i Æ j, observe that [h, Fij] = (a; — aj) Ej j. 


Proposition 25. Complezified C(SU (n 4- 1)) is semi-simple. 
Proof 
Note by direct computation that 


[Ei j, Ers] = 9j Eis — fis Er. (3.87) 


so 
(ad Eng ad Bi Es = 05r0iqEp,s ED Ojr Ops Eiq = Ôisôqr Ep j + 0isOpj Lrg (3.88) 


and hence the component of (ad Ej, ad Ei j)Er, s in the Eeg direction is 
0 jrÓjgÓptÓst — O5, ÓpsOitÓqt — SisOgrOpe;t + OispjOrsÓqt (3.89) 
There are various cases to consider 


i) K(Epq, Ei) for p Z q, i # j. We must compute Tr (ad Ep, ad Ej;). If r A s then 
the component of (ad Ep q ad E;;)E,,in the E, s direction is 


05r0iqOpr = ÒjrÔpsÔirÔqs EE OisÓqrÓprÓjs + 6is0njOgs = 05r0igOpr + bisOpjOqs (3.90) 


(as i Z j and p Z q). So the contribution to the trace from these terms is 


n+1 
S bjr5igSpr + 5isdpj5qs = 2nóisóp; (3.91) 
r—l,s—l,rzs 


We also compute the component of (ad Ej, ad Ej;)E,. in the direction Ej; — 
En+in+1 for k = 1,...,n. Observe that the component of ój; Ej, — óps Ej; in this 
direction is digdps(pk — diz) (if i A q or p # s then the diagonal components of 
ig E, —ó, Ej; vanish). Hence the component of (ad Ep, ad E; j)Er,s in the direction 
Ekk — Eien for k = 1,...,n is 


05r0iqOps (Opi z dik) 2 ÔisÔpjÔrq(Ôrk = Onk) (3.92) 
It follows that the component of (ad Ep, ad E;;)(Erg — Entin+i) along (Ek k — 


Ee ed) is 


0 jk 0iqOpk (Opk — Sik) + OikOpj5gk(1 — Spx) 
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iii) 


—(5}n+15ig5pn+1 (pk — 0; ) E 0 nr lOpiogni1ópk) (3.93) 


which can be simplified using i Z j and p Z q to 


ÓjkOigÓpk + Sik OpjSgk + 9j,n419igSpn+19ik + 5in-+19pj5q,n+19pk (3.94) 
On taking the sum from k = 1,...,n this gives a contribution to the trace of 
2ôiqûpj (3.95) 
and so 
K(Ep,a, Eig) = 2(n + 1)digdp; (3.96) 


k(Epq h) for h = diag (a1,a2,...,a541) with »5,a; = 0 and p # q. The only 
contribution to the trace Tr (ad E, ad h) comes from terms 


(ad Ej, ad h) Ei; = ad Ep.q(ai — aj) Eig = (ai — aj)(giEp — Ôp j Big) (3-97) 
for i # j. The component of this matrix along the E;,; direction is 
(a; = aj) (Ógiópi m 055054) —0 (3.98) 


as p Z q. Hence 


k(Epg h) —0 (3.99) 


K(h, g) where h = diag (a1,a2,...,a541) and g = diag (bi, b2,...,bn41) and 3; a; = 
Jibi = 0, 


The only contribution to the trace Tr (ad h ad g) is from the terms 


(ad h ad g) Ei; (3.100) 
for i Æ j. But 
(ad h ad g)Eij = (ai = aj)(bi = DES (3.101) 
so taking the sum over i and j ( Z j) we obtain 
&(h, g) = 2(n + 1) ‘Ss aibi (3.102) 
i 
Hence « is negative definite over the span over R of i( Err — En4in41) for r = 1,...,m; 


and this span is orthogonal to the span of the E;; — Ej; and i(E;; + Eji) (à Æ j). 
Furthermore, & is diagonal and negative definite over the span over R of the E; j — Ej; and 


i(Ei,4- Eji). k is therefore non-degenerate. MI 


We also have the immediate corollory: 


Corollory 2. The Lie algebra of SU (n -- 1) (as a real Lie algebra) is compact. 
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4. SU(3) and the Quark Model 


The Lie algebra of SU(3) consists of the traceless antihermitian 3 x 3 complex matrices. 


It is convenient to define the following matrices 


1 
> 0 0 2y8 
m=] 0 -i 0 hy = 0 
0 0 0 0 
1 
1 k v2 : 1 ) 
e4—-|0 0 0 ge um 
0 0 0 0 
0 0 0 0 
2 1 2 
e} = 0 0 V8 e = 0 
0 0 0 0 
1 
: 0 0 zz : 0 
ej = 0 0 0 e -— 0 
1 
0 0 0 ND. 


0 0 
LC 
0 0 
0 0 
0 0 
0 0 
0 0 
a 0 
0 0 
0 0 (4.1) 
0 0 


Then ihi, ihz and i(e'? -- e"), e'? — e" for m = 1,2,3 form a basis for the antihermitian 
traceless 3 x 3 matrices (over R), and hence are a basis for £(SU(3)). Suppose that d is 
the irreducible representation of C(SU(3)) acting on a complex vector space V which is 
induced from an irreducible representation of SU(3) acting on V. 


It is convenient to set 


Hı = d(hi), Hə = d(h2), EX det) for m = 1,2,3 (4.2) 


Then we find the following commutators: 


[H3 Hə] = 0 i 
[H;, EX] z +E}, [Hi, E$] = F Ei. 
[H», Ei] = 0, [H5, E. ] = aV gg. 
and 
[EL,E!] = Hi 
3 1 
[E?, E?] = L E 
3 1 
[E}, E3] = Nip + =H, 
2 2 
The remaining commutation relations are 
1 
[E], E?] = —E?,  [EL,E?]— 


v2 
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|n, E$] = +5 Et 
[H5, E] = "I (4.3) 
(4.4) 
1 
WE 


1 1 
[ES E’] NL [E], E3] = yg 
[Es E3] = age (ee CEOs WE (4.5) 
and 
[Fy E?] — [E2, F3] = (FL, E3] = [ET E] = [E2, E3] = [E2, E3] — 0 (4.6) 


Note in particular that Hı, Hə commute. The subalgebra of £(SU(3)) spanned by 
ih; and ih» is called the Cartan subalgebra. It is the maximal commuting subalgebra of 


£(SU(3)). 


4.1 Raising and Lowering Operators: The Weight Diagram 


The Lie algebra of £(SU(3)) can be used to obtain three sets of C(SU(2)) algebras. In 
particular, we find that 


[H;, E1] = € ET, E}, Et] = H; (4.7) 
and i 5 
3 1 3 1 
[— He — -CHi, E2] = € E?, E*,E?]| = H — -Hı (4.8) 
2 2 2 2 
and 5 
3 1 3 1 
Sin + 5M, ES] = EE, BS ES|- s Hk (4.9) 


In particular, there are three pairs of raising and lowering operators E7". 


For simplicity, consider a representation d of C(SU(3)) obtained from a unitary repre- 
sentation D of SU(3) such that d is an anti-hermitian representation- so that Hı and Hə 


are hermitian, and hence diagonalizable with real eigenvalues. Hence, Hı and V3 A. o+im, 
can be simultaneously diagonalized, and the eigenvalues are real. (In fact the same can be 
shown without assuming unitarity!) 

Suppose then that |) is an eigenstate of Hı with eigenvalue p and also an eigenstate 
of H> with eigenvalue q. It is convenient to order the eigenvalues as points in R? with 
position vectors (p,q) where p is the eigenvalue of Hı and q of Hə. (p,q) is then referred 
to as a weight. 

From the commutation relations we have the following properties 


i) Either El |d) = 0 or Et |) is an eigenstate of H1 and Hə with eigenvalue (p, q)+(1, 0) 


ii) Either EZ |ó) = 0 or E2 |) is an eigenstate with eigenvalue (p, q) + (—3, X3) 
iii) Either E2.|¢) = 0 or Ei |ó) is an eigenstate with eigenvalue (p, q) + (4. X3) 


Moreover, from the properties of C(SU(2)) representations we know that 


2p- mi, v3q—-p-m», V3q+p= ms (4.10) 
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for mi, m2, ma € Z. It follows that 2v3q € Z. It is particularly useful to plot the 
sets of eigenvalues (p, q) as points in the plane. The resulting plot is known as the weight 
diagram. As the representation is assumed to be irreducible, there can only be finitely 
many points on the weight diagram, though it is possible that a particular weight may 
correspond to more than one state. Moreover, as 2p € Z, 2\/3q € Z, the weights are 
constrained to lie on the points of a lattice. From the effect of the raising and lowering 
operators on the eigenvalues, it is straightforward to see that this lattice is formed by the 
tessalation of the plane by equilateral triangles of side 1. This is illustrated in Figure 1, 
where the effect of the raising and lowering operators is given (in this diagram (0,0) is a 
weight, though this need not be the case generically). 


The weight diagram has three axes of symmetry. To see this, recall that if m is a weight 
of a state in an irreducible representation of C(SU(2)) then so is —m. In the context of 
the three Z(SU(2)) algebras contained in Z(SU(3)) this means that from the properties 
of the algebra in (4.7), if (p,q) is a weight then so is (—p, q), i.e. the diagram is reflection 
symmetric about the line 0 — 5 passing through the origin. Also, due to the symmetry of 
the £(SU(2)) algebra in (4.8), the weight diagram is reflection symmetric about the line 
0 = & passing through the origin: so if (p, q) is a weight then so is (3(p4- V3q), 3(V3p— q)). 
And due to the symmetry of the £(SU(2)) algebra in ((4.9) the weight diagram is reflection 
symmetric about the line 0 — = passing through the origin: so if (p,q) is a weight then so 
is ($(p — v34), $(—V3p — q)). 


Using this symmetry, it suffices to know the structure of the weight diagram in the 


sector of the plane § < 0 < 5. The remainder is fixed by the reflection symmetry. 
Motivated by the treatment of SU(2) we make the definition: 


Definition 42. |Y) is called a highest weight state if |Y) is an eigenstate of both Hy and 
Ho, and E? |y) = 0 for m = 1,2,3. 

Note that there must be a highest weight state, for otherwise one could construct 
infinitely many eigenstates by repeated application of the raising operators E^". Given 


-67 — 


a highest weight state, let V^ be the vector space spanned by |y) and states obtained by 
acting with all possible products of lowering operators E” on |y). As there are only finitely 
many points on the weight diagram, there can only be finitely many such terms. Then, by 
making use of the commutation relations, it is clear that V' is an invariant subspace of V. 
As the representation is irreducible on V, this implies that V' = V, i.e. V is spanned by 
|) and a finite set of states obtained by acting with lowering operators on |v). Suppose 
that (p,q) is the weight of |Y). Then V is spanned by a basis of eigenstates of Hı and Hə 
with weights confined to the sector given by m < 0 < om relative to (p, q)- all points on the 
weight diagram must therefore lie in this sector. 


Lemma 6. The highest weight state is unique. 


Proof Suppose that |W) and |V/) are two highest weight states with weights (p,q), (p', q’) 
respectively. Then (p', ^) must make an angle m < 0 < zs relative to (p, q) and (p, q) must 
make an angle m < 0 < om relative to (p', q^). This implies that p = p', q = q'. 

Next suppose that |v1) and |v3) are two linearly independent highest weight states 
(both with weight (p,q)). Let Vi; and Vz be the vector spaces spanned by the states 
obtained by acting with all possible products of lowering operators E"' on |v4) and |v») 
respectively; one therefore obtains bases for Vj and V3 consisting of eigenstates of Hı and 
Hə. By the reasoning given previously, as Vi and V2 are invariant subspaces of V and the 
representation is irreducible on V, it must be the case that Vj = V2 = V. In particular, we 
find that |v53) € Vi. However, the only basis element of Vi which has weight (p,q) is |v1), 
hence we must have |w2) = c|v) for some constant c, in contradiction to the assumption 
that |q 1) and |v») are linearly independent. M 

Having established the existence of a unique highest weight state |), we can proceed 
to obtain the generic form for the weight diagram. Recall that the highest weight j of an 
irreducible representation of £(SU(2)) is always non-negative. By acting on |v) with the 
lowering operators E"', one obtains three irreducible representations of C(SU(2)). Non- 
negativity of the highest weight corresponding to the £(SU(2)) irreducible representation 
generated by E! implies that the highest weight must lie in the half-plane to the right of 


the line 0 = 4, or on the line 0 = 5: 


Non-negativity of the highest weight corresponding to the £(SU(2)) irreducible represen- 
tation generated by E? implies that the highest weight must lie in the half-plane above 


the line 0 = @, or on the line 0 = ¢: 
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Finally, non-negativity of the highest weight corresponding to the Z(SU(2)) irreducible 


representation generated by E? implies that the highest weight must lie in the half-plane 


above the line 0 — 9t or on the line 0 — an. 


As the highest weight must lie in all three of these regions, it must lie in the sector 
6 <9< 5 relative to (0,0), or at the origin: 


l 
| 
bd 
au 
| 
M 


Lemma 7. Ifthe highest weight is (0,0), then there is only one state in the representation, 
which is called the singlet. 
Proof 

Let |Y) be the highest weight state with weight (0,0). Suppose that E"'|w) 4 0 for 
some m. Then by the reflection symmetry of the weight diagram, it follows that E^" |y) 4 0, 
in contradiction to the fact that E* |v) = 0 for i = 1,2,3, as |v) is the highest weight 
state. Hence ET |y) = 0 for m = 1,2,3. Also Hi |y) = H2 |p) = 0. It follows that the 


1-dimensional subspace V’ spanned by |v) is an invariant subspace of V, and therefore 
V = V' as the representation is irreducible. M 
There are then three possible locations for the highest weight state |y}. 


4.1.1 Triangular Weight Diagrams (I) 


Suppose that the highest weight lies on the line 6 = 5. In this case, by applying powers of 
E? the states of the £(SU(2)) representation given in (4.8) are generated. These form a 
line orthogonal to the axis of reflection 0 = §, about which they are symmetric, and there 
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are no states outside this line, as these points cannot be reached by applying lowering 
operators. Then, by using the reflection symmetry, it follows that the outermost states 
from an equilateral triangle with horizontal base. Each lattice point inside the triangle 
corresponds to (at least) one state which has this weight, because each lattice point in the 
triangle lies at some possible weight within the £(.SU(2)) representation given in (4.7), 
and from the properties of £(SU(2)) representations, we know that this has a state with 
this weight (ie. as the C(SU(2)) weight diagram has no “holes” in it, neither does the 
L(SU(3)) weight diagram). 
This case is illustrated by 


Proposition 26. Each weight in this triangle corresponds to a unique state. 
Proof 

Note that all of the states on the right edge of the triangle correspond to unique states, 
because these weights correspond to states which can only be obtained by acting on |q) 
with powers of E?. It therefore follows by the reflection symmetry that all of the states on 
the edges of the triangle have multiplicity one. 

Now note the commutation relation 


[EL,E?] = -—— E? (4.11) 


v2 ~ 

This implies that products of lowering operators involving E? can be rewritten as 
linear combinations of products of operators involving only E! and E? (in some order). 
In particular, we find 


(El)(E2)"|w) = [El, E?](E? "^! |y) + E? EL (E2)! y) 
= -E (E2 Iu) + E2 E! (E2)! |y) 
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= ARCH) (412) 


by simple induction, where we have used the fact that E! |) = 0 and [E? , E?] = 0. 


A generic state of some fixed weight in the representation can be written as a linear 
combination of products of E? and E! lowering operators acting on |Y) of the form 


T(E}, E?) |y) (4.13) 


where II(E! , E?) contains m powers of E? and l powers of E! where m, £ are uniquely 
determined by the weight of the state- only the order of the operators is unfixed. 


Using (4.12), commute the E! states in this product to the right as far as they will 
go. Then either one finds that the state vanishes (due to an E! acting directly on |2)), or 
one can eliminate all of the E! terms and is left with a term proportional to 


(E yn-* (gy p) (4.14) 


where we have used the commutation relations [E?, E?] = [E! , E?] = 0. 


Hence, it follows that all weights in the diagram can have at most multiplicity 1. 
However, from the property of the Z(SU(2)) representations, as the weights in the outer 
layers have multiplicity 1, it follows that all weights in the interior have multiplicity at 
least 1. 


Hence, all the weights must be multiplicity 1. M 


4.1.2 Triangular Weight Diagrams (II) 


Suppose that the highest weight lies on the line 0 = @. In this case, by applying powers of 
E the states of the £(SU(2)) representation given in (4.7) are generated. These form a 
horizontal line orthogonal to the axis of reflection 0 — 5, about which they are symmetric, 
and there are no states outside this line, as these points cannot be reached by applying 
lowering operators. Then, by using the reflection symmetry, it follows that the outermost 
states from an inverted equilateral triangle with horizontal upper edge. Each lattice point 
inside the triangle corresponds to (at least) one state which has this weight, because each 
lattice point in the triangle lies at some possible weight within the £(SU(2)) representation 
given in (4.7), and from the properties of C(SU(2)) representations, we know that this has 
a state with this weight (i.e. as the C(SU(2)) weight diagram has no “holes” in it, neither 
does the £(SU(3)) weight diagram). 


'This case is illustrated by 


ep 


Proposition 27. Each weight in this triangle corresponds to a unique state. 
Proof 

Note that all of the states on the horizontal top edge of the triangle correspond to 
unique states, because these weights correspond to states which can only be obtained by 
acting on |v) with powers of El. It therefore follows by the reflection symmetry that all 
of the states on the edges of the triangle have multiplicity one. 

Now, using (4.11) it is straightforward to show that 
n 


E?(E*)" |4) = T 


E? (ELY yy (4.15) 
for n > 1, where we have used E? |y) = 0. 
Now consider a state of some fixed weight in the representation; this can be written as 


a linear combination of terms of the form 
I(E}, E?) |y) (4.16) 


where II(E! , E?) contains m powers of E! and £ powers of E? in an appropriate order, 
where m and £ are determined uniquely by the weight of the state in question. 

Using (4.15), commute the E? states in this product to the right as far as they will 
go. Then either one finds that the state vanishes (due to an E? acting directly on |v)), or 
one can eliminate all of the E! terms and is left with a term proportional to 


(EL) (EÈ) py) (4.17) 


where we have used the commutation relations [E? , E] = [El , E?] = 0. 


Hence, it follows that all weights in the diagram can have at most multiplicity 1. 
However, from the property of the L(SU(2)) representations, as the weights in the outer 
layers have multiplicity 1, it follows that all weights in the interior have multiplicity at 
least 1. 

Hence, all the weights must be multiplicity 1. M 
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4.1.3 Hexagonal Weight Diagrams 


Suppose that the highest weight lies in the sector § < 0 < 5. In this case, by applying 
powers of E! the states of the £(SU(2)) representation given in (4.7) are generated. These 
form a horizontal line extending to the left of the maximal weight which is orthogonal 
to the line 0 — 5, 
points cannot be reached by applying lowering operators. Also, by applying powers of E? 


about which they are symmetric, There are no states above, as these 


the states of the £(SU(2)) representation given in (4.8) are generated. These form a line 
extending to the right of the maximal weight which is orthogonal to the axis of reflection 
0e, 
as these points cannot be reached by applying lowering operators. 


about which they are symmetric, and there are no states to the right of this line, 


Then, by using the reflection symmetry, it follows that the outermost states form a 
hexagon. Each lattice point inside the hexagon corresponds to (at least) one state which 
has this weight, because each lattice point in the hexagon lies at some possible weight 
within the £(SU(2)) representation given in (4.7), and from the properties of C(SU(2)) 
representations, we know that this has a state with this weight (i.e. as the C(SU(2)) weight 
diagram has no “holes” in it, neither does the £(SU(3)) weight diagram). 

'This case is illustrated by 


The multiplicities of the states for these weight diagrams are more complicated than 
for the triangular diagrams. In particular, the weights on the two edges of the hexagon 
leading off from the highest weight have multiplicity 1, because these states can only be 
constructed as (E! )" |Y) or (E?) |). So by symmetry, all of the states on the outer layer 
of the hexagon have multiplicity 1. However, if one proceeds to the next layer, then the 
multiplicity of all the states increases by 1. This happens until the first triangular layer is 
reached, at which point all following layers have the same multiplicity as the first triangular 
layer. 
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Suppose that the top horizontal edge leading off the maximal weight is of length m, 
and that the other outer edge is of length n, with m m. This situation is illustrated 
below 


The highest weight is then at (5 (m 4- 2n)). The outer n layers are hexagonal, 


2°? DNE 
whereas the n 4- 1-th layer is triangular, and all following layers are also triangular. As 


one goes inwards through the outer n 4- 1 layers the multiplicity of the states in the layers 
increases from 1 in the first outer layer to n + 1 in the n + 1-th layer. Then all the states 
in the following triangular layers have multiplicity n 4- 1 as well. 

We will prove this in several steps. 


Proposition 28. States with weights on the k-th hexagonal layer for k =1,...,n or the 
k — n 4- 1-th layer (the first triangular layer) have multiplicity not exceeding k. 
Proof 
In order to prove this, consider first a state on the upper horizontal edge of the k-th 
layer for k < n+ 1. The length of this edge is m — k + 1. A general state on this edge is 
obtained via 
II(E? , EL) |y) (4.18) 


where II(E? , E!) contains (in some order) k — 1 powers of E? and £ powers of E! for 
€=k-—1,...,m 

Now use the commutation relation (4.11) to commute the powers of E? to the right as 
far as they will go. Then the state can be written as a linear combination of the k vectors 


|vi) = (BE) t (gl yr n qe ye y) (4.19) 


for i = 1,...,k. 
It follows that this state has multiplicity < k. 
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Next consider a state again on the k-th level, but now on the edge leading off to the 
right of the horizontal edge which we considered above; this edge is parallel to the outer 
edge of length n. Take k < n+ 1, so the edge has length n — k + 1. A state on this edge is 
obtained via 

TI(E2, E?) v) (4.20) 


where ÍII(E! , E?) contains (in some order) k — 1 powers of E! and £ powers of E? where 
(— k — 1,...,n. Now use the commutation relation (4.11) to commute the powers of E! 
to the right as far as they will go. Then the state can be written as a linear combination 
of the k vectors 


Jwi) = (ES t qe y (et y" |y) (4.21) 


for i = 1,..., k. 

So these states also have multiplicity < k. By using the reflection symmetry, it follows 
that the all the states on the k-th hexagonal layer have multiplicity k. MI 

We also have the 


Proposition 29. The states with weights in the triangular layers have multiplicity not 
exceeding n+ 1. 
Proof 

Consider a state on the k-th row of the weight diagram for m +1 > k > n+ 1 which 
lies inside the triangular layers. Such a state can also be written as 


I(E? , EL) y) (4.22) 


where II(E? , E!) contains (in some order) k — 1 powers of E? and £ powers of E! for 
L = k—1,...,m. and hence by the reasoning above, it can be rewritten as a linear 
combination of the k vectors |v;) in (4.19), however for i < k—n, |v;) = 0 as (E2)*-* |y) = 0. 
The only possible non-vanishing vectors are the n + 1 vectors |Uk-n} , |uk naa) ;. --  |vi)- 
Hence these states have multiplicity < n+ 1. M 

Next note the lemma 


Lemma 8. Define |wi y) = (E? y 1(ELy)*(E2)*75 hy) fori 5 1,..., k, k=1,...,n +1. 
Then the sets Sy = {|wi,k),---,|Wk,n)} are linearly independent fork =1,...,n+1. 
Proof 


By using the commutation relations, it is straightforward to prove the identities 


. i 
ET uy D(——4- =p + 5+1- k) lwi-1,-1) 


i dr 
(k — i) (—q— zp- + 5 5 Pi k-1) 


(eu — 1)) pus gx) (4.23) 


(with obvious simplifications in the cases when i = 1 or i = k) 
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Note that Sı = (|v)) is linearly independent. Suppose that Sj. is linearly indepen- 
dent (k > 2). Consider Sk. Suppose 


k 
S ci |Win) 20 (4.24) 
i=l 


for some constants cj. Applying E? to (4.24) and using the linear independence of 
S,..1 we find the relation 


i 3 
(2 4 k)ci 
i( 2 zP + 5 + 5 )Ci+1 


for i= 1,...,k— 1. Applying E? to (4.24) another recursion relation is obtained 


1 jo add 
pei tlk yw pica k)e; = 0 (4.26) 


V2 


Combining these relations we find c;41 = 0 fori = 1,..., k— 1. If v3q ipti | 1 lk 0 
when i = 1 then one also has c; = 0. This holds if k < n + 1, however if k = n + 2 then c 


is not fixed by these equations. The induction stops at this point. MI 


These results are sufficient to fix the multiplicity of all the states. This is because the 


vectors in Sy for 1 < k < k +1 correspond to states with weight (p,q) — (k — 1)(3, v3) 


which are at the top right hand corner of the k-th hexagonal (or outermost triangular for 


k = n+ 1) layer. We have shown therefore that these weights have multiplicity both less 
than or equal to, and greater than or equal to k. Hence these weights have multiplicity 
k. Next consider the states on the level k edges which are obtained by acting with the 
L(SU(2)) lowering operators E! and E? on the “corner weight” states. Observe the 
following: 


Lemma 9. Let d be a representation of C(SU(2)) on V be such that a particular C(SU(2)) 
weight m > 0 has multiplicity p. Then all weights m' such that |m/| < m have multiplicity 
>p 

whose proof is left as an exercise. 

By this lemma, all the states on the k-th layer obtained in this fashion have multi- 
plicity k also. Then the reflection symmetry implies that all states on the k-th layer have 
multiplicity k. 

In particular, the states on the outer triangular layer have multiplicity n + 1. We 
have shown that the states on the triangular layers must have multiplicity not greater than 
n -- 1, but by the lemma above together with the reflection symmetry, they must also have 
multiplicity > n + 1. Hence the triangular layer weights have multiplicity n + 1, and the 
proof is complete. 

This was rather long-winded. There exist general formulae constraining multiplicities 
of weights in more general Lie group representations, but we will not discuss these here. 
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4.1.4 Dimension of Irreducible Representations 


Using the multiplicity properties of the weight diagram, it is possible to compute the 
dimension of the representation. We consider first the hexagonal weight diagram for m > n. 

Then there are 1+---+(m—n)+(m—n+1) = $(m—n+1)(m—n-+ 2) weights in the 
interior triangle. Each of these weights has multiplicity (n + 1) which gives $(n + 1)(m — 
n + 1)(m — n + 2) linearly independent states corresponding to weights in the triangle. 
Consider next the k-th hexagonal layer for k = 1,...,n. This has 3((m 4- 1 — (k — 1)) + 
(n 4- 1— (k — 1)) 2 2) 2 3(m +n + 2 — 2k) weights in it, and each weight has multiplicity 


k, which gives 3k(m +n + 2 — 2k) linearly independent states in the k-th hexagonal layer. 


'The total number of linearly independent states is then given by 
1 i 1 
5 (t1) (m-n- 1)(m—n4-2)4-M ^ 3k(m-n42-2k) = 5(m+1)(n+1)(m+n+2) (4.27) 
k=1 
This formula also applies in the case for m < n and also for the triangular weight diagrams 
by taking m = 0 or n = 0. The lowest dimensional representations are therefore 1,3,6,8,10... 


4.1.5 The Complex Conjugate Representation 


Definition 43. Let d be a representation of a Lie algebra L(G) acting on V. If v € L(G), 
then viewing d(v) as a matriz acting on V, the complex representation d is defined by 


d(v)u — (d(v))*u (4.28) 


foru € V, where x denotes matrix complex conjugation. 
Note that as d(v) is linear in v over R, it follows that (d(v))* is also linear in v over 
IR. Also, as 
d([v, w]) = d(v)d(w) — d(w)d(v) (4.29) 


for v,w € L(G), so taking the complex conjugate of both sides we find 


d([v, w]) = d(v)d(w) — d(w)d(v) (4.30) 


ie. d is indeed a Lie algebra representation. 
Suppose that Tą are the generators of L(G) with structure constants ca^. Then as d 
is à representation, 


[d(14), d(T) = Cap A(Te) (4.31) 


Taking the complex conjugate, and recalling that Cab? are real, we find 


[d(5), d(15)] = cav°d (Te) (4.32) 


ie. the d(T,) and d(T;) satisfy the same commutation relations. 

In the context of representations of C(SU(3)), the conjugate operators to (H1, iH», 
(EP? -- E") and E% — E" are denoted by iHi, iH», i(E"' + E), and E?" — E" respectively 
and are given by 


iH, = (iHi)* 


cT 


ily (Hay 
iE? + ET) = (EP + ET) 
E? Em = (gm — gmy (4.33) 


which implies 


H,--Un). Ho--ünm), EP =—-(EZ)* (4.34) 


Then Hi, Hz and ET satisfy the same commutation relations as the unbarred opera- 


tors, and also behave in the same way under the hermitian conjugate. One can therefore 
plot the weight diagram associated with the conjugate representation d, the weights being 
the (real) eigenvalues of Hı and Hj. But as Hı = —(Hi)* and Hy = —(H3)* it follows 
that if (p,q) is a weight of the representation d, then (—p, —q) is a weight of the represen- 
tation d. So the weight diagram of d is obtained from that of d by inverting all the points 
(p,q) —^ —(p,q). Note that this means that the equilateral triangular weight diagrams A 
and V of equal length sides are conjugate to each other. 


4.2 Some Low-Dimensional Irreducible Representations of Z(SU(3)) 


4.2.1 The Singlet 


The simplest representation has only one state, which is the highest weight state with 
weight (0,0). This representation is denoted 1. 


4.2.2 3-dimensional Representations 


Take the fundamental representation. Then as hı and hg are already diagonalized, it is 
straightforward to compute the eigenstates and weights. 
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State Weight 


1 
The state of highest weight is 1 which has weight (4, ——). The weight diagram is 
0 
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This representation is denoted 3. It will be convenient to define the following states 
in the 3 representation. 


1 
u=|0], d= 
0 


or © 


0 
j s=]|0 (4.35) 
1 


so that u has weight (5, A) d has weight (—3, 33) and s has weight (0, =a) 


The lowering operators have the following effect: d = vV2el u, s = V2e2u and s = 
v2e? d. 

The complex conjugate of this representation is called 3 and the weights are obtained 
by multiplying the weights of the 3 representation by —1. 
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The state of highest weight is which has weight (0, 5) The weight diagram is 


g 


(4.36) 


so that ü has weight (— ba d has weight (5 (73) and s has ipd (0, z5). 
The lowering operators have the following effect: u = = —2e8 s ,d= —\/2e2 s and 
= —4/2el d; where ef = —(e™)*. 
Exercise: Verify that all other lowering operators é™ (except those given above) an- 
nihilate ü, d, 5. Also compute the effect of the raising operators gh. 
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4.2.3 Eight-Dimensional Representations 


Consider the adjoint representation defined on the complexified Lie algebra £(SU (3)), i.e. 
ad(v)w = [v,w]. Then the weights of the states can be computed by evaluating the 
commutators with hı and ho: 


State v | [h1, v] | [he, v] Weight 
hi 0 (0,0) 
ha 0 0 (0,0) 
el el 0 (1,0) 
el —el 0 (—1,0) 
a (ja pxe CL 
e? ie? -Ve (1,- V8) 
a pipa d 
e [ai [Fe Cho») 


The highest weight state is e? with weight (3, m All weights have multiplicity 1 


except for (0,0) which has multiplicity 2. The weight diagram is a regular hexagon: 


4.3 Tensor Product Representations 


Suppose that dı, d2 are irreducible representations of Z(SU(3)) acting on Vi, V2 respec- 
tively. Then let V = V; 69 V2 and d = dı Q 1-- 1G də be the tensor product representation 
of C(SU(3)) on V. In general d is not irreducible on V, so we want to decompose V into 
a direct sum of invariant subspaces on which the restriction of d is irreducible. 

To do this, recall that one can choose a basis of V4 which consists entirely of eigenstates 
of both di (hi) and dı(h2). Similarly, one can also choose a basis of V2 which consists entirely 
of eigenstates of both d2(h1) and da(h3). Then the tensor product of the basis eigenstates 
produces a basis of Vi &9 V2 which consists of eigenstates of d(h1) and d(h2). 

Explicitly, suppose that |¢;) € V; is an eigenstate of d;(hi) and d;(h2) with weight 
(pi qi) (ie. di(hi)|$;) = pildi) and d;(h2)|é$;) = qi|d;)) for à = 1,2. Define |d) = 


BI = 


[61) & |ó2). Then 


d(hi) |e) = (dı (hı) |b1)) & |2) + 161) & (da(h1) |2)) 
= (m |é1)) & |2) + |41) & (p2 |2)) 
= (pı + pa) |) (4.37) 


and similarly 


d(ha) |$) = (qı + a2) |9) (4.38) 


So the weight of |) is (pı + po, qı + q2); the weights add in the tensor product repre- 
sentation. 

Using this, one can plot a weight diagram consisting of the weights of all the eigenstates 
in the tensor product basis of V, the points in the weight diagram are obtained by adding 
the pairs of weights from the weight diagrams of dı and d» respectively, keeping careful 
track of the multiplicities (as the same point in the tensor product weight diagram may be 
obtained from adding weights from different states in Vi ®) V2.) 

Once the tensor product weight diagram is constructed, pick a highest weight, which 
corresponds to a state which is annihilated by the tensor product operators E^" for m = 
1,2,3. (Note that as the representation is finite-dimensional such a state is guaranteed 
to exist, though as the representation is no longer irreducible, it need not be unique). If 
there are multiple highest weight states corresponding to the same highest weight, one 
can without loss of generality take them to be mutually orthogonal. Picking one of these, 
generate further states by acting on a highest weight state with all possible combinations 
of lowering operators. The span of these (finite number) of states produces an invariant 
subspace W, of V on which d is irreducible. Remove these weights from the tensor product 
weight diagram. If the multiplicity of one of the weights in the original tensor product 
diagram is k, and the multiplicity of the same weight in the W; weight diagram is k’ then 
on removing the W, weights, the multiplicity of that weight must be reduced from k to 
k — k'. 

Repeat this process until there are no more weights left. This produces a decomposition 
V = W: Q... Wk of V into invariant subspaces W; on which d is irreducible. 

Note that one could also perform this process on triple (and higher order) tensor 
products e.g. Vi Q V5 Q V3. In this case, one would construct the tensor product weight 
diagram by adding triplets of weights from the weight diagrams of dı on Vi, dz on Vz and 
d3 on V3 respectively. 

'This process can be done entirely using the weight diagrams, because we have shown 
that for irreducible representations, the location of the highest weight fixes uniquely the 
shape of the weight diagram and the multiplicities of its states. 

We will see how this works for some simple examples: 


4.3.1 306 3 decomposition. 


Consider the 3 & 3 tensor product. Adding the weights together one obtains the following 
table of quark content and associated weights 
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Quark content and weights for 3 @ 3 
Quark Content Weight 
1 

d@d (-1,3) 
ses (0, — 5.) 
ugd, d&u (0, Js) 

Ugs, SQU (3:7 a3) 
1 1 

d&s,s&d C-A 


The raising and lowering operators are EY = e @1+1@e"% 
is u & u with weight (1 


1 
(vg) 


. The highest weight state 


Applying lowering operators to u & u it is clear that a 


six-dimensional irreducible representation is obtained. The (unit-normalized) states and 


weights are given by 


States and weights for the 6 in 3 3 


State Weight 
1 
2 
sas (0, — 3) 
Jg (d 8 uc u& d) (0, Js) 
Jg(u8 st seu) (5-zx) 
Jg (d& sc sad) (-5$.— 31) 
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which has the following weight diagram 


This representation is called 6. Removing the (non-vanishing) span of these states 
from the tensor product space, one is left with a 3-dimensional vector space. The new 
highest weight is at (0, Fa with corresponding state za Q u — u Q d) (this is the unique 
linear combination- up to overall scale- of d & u and u & d which is annihilated by all the 
raising operators). This generates a 3. The states and their weights are 


States and weights for the 3 in 3 & 3 
State Weight 

zdgu-ugd) (0, 75) 

Jg (d& s—58 d) Cia) 

yis@u-ues)| (D-7) 


Hence 3®3=603. 


The states in the 6 are symmetric, whereas those in the 3 are 
antisymmetric. 


4.3.2 3@3 decomposition 


For this tensor product the quark content /weight table is as follows: 
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Quark content and weights for 3 @ 3 
Quark Content Weight 
ues (5.2) 
ugd (1,0) 
d&s DEN E 
u®ut,d@d,s@s (0,0) 
d&u (—1,0) 
s&ā (en) 
sod =>) 


with weight diagram 


The raising and lowering operators are EY = e & 1+ 1 eT All weights have 


multiplicity 1, except for (0,0) which has multiplicity 3. The highest weight state is u & 5 
with weight (5, SS). Acting on this state with all possible lowering operators one obtains 
an 8 with the following states and weights 


States and weights for the 8 in 3 8 3 
State Weight 
ugs (4, X3) 
u®d (1,0) 
d&s (ub 
Jm (d&d—-ucu)-(d&dtu&u-2scs) (0, 0) 
dou (=1,0) 
seü ied. 
sed (3.33) 
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Removing these weights from the weight diagram, one is left with a singlet 1 with 
weight (0,0), corresponding to the state 


1 J 
—z(u& ü--ses--da&d) (4.39) 


V3 


which is the unique linear combination- up to an overall scale- of u& ü, s&5 and d&d which 
is annihilated by the raising operators E?*. Hence we have the decomposition 393 = 891. 


4.3.3 30 3 8&3 decomposition. 


For this tensor product the quark content/weight table is as follows: 


Quark content and weights for 38383 


Quark Content Weight 
uSu@u (3, X3) 

sess (0, —4/3) 
d@de@d ae 
UBDUAS, UQsQu,sQu@Bu (1,0) 
u@u@d,u@d@u,d@ue@u| (4,¥) 


SQSQOU, SQUOS, UQs@s (1,93) 
seoscd,sedes deseas (-1,-X3) 
d®d®s,d®s@d,s@ded (—1,0) 
d@d@u,d@u@duaded| (-4,%8) 


u@®d@s,us@d, d@u@s, 
d®s@u, s@u@d,s@d@u (0,0) 


with weight diagram 


The raising and lowering operators are E? = ef 181-c-1,8epol-l1G1ceT. 
There are six weights of multiplicity 3, and the weight (0,0) has multiplicity 6. The highest 
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weight is u & u & u with weight (3, X3), By applying lowering operators to this state, one 
obtains a triangular 10-dimensional irreducible representation denoted by 10, which has 
normalized states and weights: 


States and weights for 10 in 38383 
State Weight 
u@u@u Ga) 
S@s@s (0, —/3) 
d&d&d (23, 99) 
FUSuSst+uS@sQut+sQuGu) (1,0) 
uSu@dtu@d@u+d@u@uw) gx» 
Fs @s@utsQust+u@s@s) C9) 
q(s@s@d+s@d@st+dBs@s) (-i,- X3) 
JA (d8destdasedeseded) | (-10) 
J(dedeucdeugdrueded)| (-4. $) 
FluS®d@s+uSs@d+dQu® s+ 
d@s@uts@uOd+s@d@u) (0, 0) 


The 10 weight diagram is 


Removing the (non-vanishing) span of these states from the tensor product space, one 
is left with a 17-dimensional vector space. The new weight diagram is 
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Note that the highest weight is now (1 v3) 


5,7): This weight has has multiplicity 2. It 
should be noted that the subspace consisting of linear combinations of d®u@u, u®d@u 
and u & u & d which is annihilated by all raising operators E^" 


™ is two-dimensional and 
is spanned by the two orthogonal states og (d Qu@utu@S@d@u— 2u@u® d) and 


zld QuS@u-—uSd@u). By acting on these two states with all possible lowering 
operators, one obtains two 8 representations whose states are mutually orthogonal. 


The states and weights of these two 8 representations are summarized below: 


States and weights for an 8 in 389383 


State Weight 
(d@u®@utuSd@u— 2u@u@d) (1, Y3) 


(s&u&u-c-u&s&u-—2uGQuG s) (1 
g;Qdedeu-d&ued-uceded) | (-$ 
(s@d@uts@uadtdasau 


+u@s@d—2d®@u@s—2u@d@s), 
74,05 8d u-c24d89 sGu-sQu&gd 


—-d&àu&s—u&se&d-—uc&das) (0, 0) 
-(s@d@d+d@s@d-22Med@s) | (-1,0) 
q(2s@s@u—s@u@s—uSs@s) ( 
7(28@s@d—-s@d®s—d@s®s) (1,293) 


«LS -Bi- A 


Sl- Sl- $l- 
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States and weights for another 8 in 89936 3 


State Weight 
l(deusu-ucdeu) ($5.38) 
A (s8uBu-uGsQu) (1,0) 
4(d@u@d—ue@ded) (-3,8) 


5(s@d@ut+s@usSd—ds@u—ueSs@d), 
i(seu&dc-d&u&s—-u&s&d-—uc&das) (0,0) 


Jí(seded-desed) (—1,0) 
vg(s8u&Bs-ugsas) QE 
J(5edes-de&sas) (t=) 


Removing these weights from the weight diagram, we are left with a singlet 1 with 
weight (0,0). The state corresponding to this singlet is 


1 
—(s@d®@u—s®uSd+dOuSs—dBsGut+uSs@d—uSds) (4.40) 


v6 


which is the only linear combination-up to overall scale- of u®d@s, u& sc d,d&uG&s, 
d®s@u,s®u@dand s&dG u which is annihilated by all the raising operators. 


Hence we have the decomposition 89 39 3— 1090 8908 01 where the states in 10 
are symmetric, but the state in 1 is antisymmetric. The 8 states have mixed symmetry. 


4.4 The Quark Model 


It is possible to arrange the baryons and the mesons into SU(3) multiplets; i.e. the states 
lie in Hilbert spaces which are tensor products of vector spaces equipped with irreducible 
representations of C(SU(3)). To see examples of this, it is convenient to group hadrons into 
multiplets with the same baryon number and spin. We plot the hypercharge Y = S + B 
where S is the strangeness and B is the baryon number against the isospin eigenvalue /3 
for these particles. 


4.4.1 Meson Multiplets 


The pseudoscalar meson octet has B = 0 and J = 0. The (75, Y) diagram is 
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There is also a J = 0 meson singlet 1’. 


The vector meson octet has B = 0 and J = 1. The (J3, Y) diagram is 


There is also a J = 1 meson singlet, ¢. 


4.4.2 Baryon Multiplets 


The baryon decuplet has B — 1 and J — 3 with (I3, Y) diagram 


— 00 — 


There is also an antibaryon decuplet with (I3, Y) — —(1I3, Y). 
The baryon octet has B = 1, J = i with (I3, Y) diagram 


and there is also a J — I baryon singlet A9*. 


4.4.3 Quarks: Flavour and Colour 


On making the identification (p, q) = (Js, vay the points on the meson and baryon octets 
and the baryon decuplet can be matched to points on the weight diagrams of the 8 and 10 
of £(SU(3)). 

Motivated by this, it is consistent to consider the (light) meson states as lying within 
a 3 Q3; as 3@3 = 8 9 1, the meson octets are taken to correspond to the 8 states, and 
the meson singlets correspond to the singlet 1 states. The light baryon states lie within a 
3 ® 3 & 3; the baryon decuplet corresponds to the 10 in 383 8&3 =10 98898 1; the 
baryon octet corresponds to appropriate linear combinations of elements in the 8 irreps, 
and the baryon singlet corresponds to the 1. 
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In this model, the fundamental states in the 3 are quarks, with basis states u (up), 
d (down) and s (strange). The basis labels u,d, s are referred to as the flavours of the 
quarks. The 3 states are called antiquarks with basis u, d, 5. Baryons are composed of 
bound states of three quarks qqq, mesons are composed of bound states of pairs of quarks 
and antiquarks qq. The quarks have J — I and B = i whereas the antiquarks have J — i 


and B — -i which is consistent with the values of B and J for the baryons and mesons. 
(13, Y) plane: 


We have shown that mesons and baryons can be constructed from qq and qqq states 
respectively. But why do qq particles not exist? This problem is resolved using the notion 
of colour. Consider the A** particle in the baryon decuplet. This is a u& u & u state with 
Ju 3. 'The members of the decuplet are the spin 3 baryons of lowest mass, so we assume 
that the quarks have vanishing orbital angular momentum. Then the spin J = 3 is obtained 
by having all the quarks in the spin up state, i.e. u T &u T1 Qu T. However, this violates the 
Pauli exclusion principle. To get round this problem, it is conjectured that quarks possess 
additional labels other than flavour. In particular, quarks have additional charges called 
colour charges- there are three colour basis states associated with quarks called r (red), g 
(green) and b (blue). The quark state wave-functions contain colour factors which lie in 
a 3 representation of SU(3) which describes their colour; the colour of antiquark states 
corresponds to a 3 representation of SU(3) (colour). This colour SU(3) is independent of 
the flavour SU(3). 

These colour charges are also required to remove certain discrepancies (of powers of 
3) between experimentally observed processes such as the decay 7° — 2y and the cross 
section ratio between the processes ete” — hadrons and ete~ — wtp and theoretical 
predictions. However, although colour plays an important role in these processes, it seems 
that one cannot measure colour directly experimentally- all known mesons and baryons 
are SU(3) colour singlets (so colour is confined). This principle excludes the possibility 
of having qq particles, as there is no singlet state in the SU(3) (colour) tensor product 
decomposition 3 & 3, though there is in 3 Q 3 & 3 and 3 Q 3. Other products of 3 and 3 
can also be ruled out in this fashion. 

Nevertheless, the decomposition of 3 & 3 is useful because it is known that in addition 
to the u, d and s quark states, there are also c (charmed), t (top) and b (bottom) quark 
flavours. However, the c, t and b quarks are heavier than the u, d and s quarks, and are 
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unstable- they decay into the lighter quarks. The SU(3) symmetry cannot be meaningfully 
extended to a naive SU(6) symmetry because of the large mass differences which break the 
symmetry. In this context, meson states formed from a heavy antiquark and a light quark 
can only be reliably put into 3 multiplets, whereas baryons made from one heavy and two 
light quarks lie in 3 ® 3 = 6 $ 3 multiplets. 
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5. Spacetime Symmetry 


In this section we examine spacetime symmetry in the absence of gravity. Spacetime 
is taken to be 4-dimensional Minkowski space, M^ , with real co-ordinates z^ for u = 
0,1,2,3, equipped with the Minkowski metric which has the non-vanishing components 


Noo "i N22 733 = 1, or as a matrix 
1 0 0 0 
0 —1 0 0 
= (n) = 5.1 
(w = (n) e e T Gi 
0 0 0 —1 


5.1 The Lorentz Group 


A Lorentz transformation is a linear transformation A : M^ — M* which transforms 
co-ordinates 
pP c (5.2) 


for A", € R, but leaves the length invariant 
mpg ee ate (5.3) 


for all x. 


This condition can be rewritten in matrix notation as 
ATA =n (5.4) 


Suppose that A1, Ag are two 4 x 4 matrices satisfying (5.4) then it is straightforward to 
see that A1A» satisfies (5.4). Also, if A satisfies (5.4) then det A = +1, hence A is invertible, 
with inverse A7! = g- 1 ATm, and 


(AT)? nA = nAn nay ATN 
= nAn AT 
= AAT! 
=n (5.5) 


so AT! is also a Lorentz transformation. Hence, the set of Lorentz transformations forms 
a group, under matrix multiplication. 


Write a generic Lorentz transformation as 


jl 5) (5.6) 


where Ac R, a, € R? and R is a 3 x 3 real matrix 
Then the constraint (5.4) is equivalent to 


and 


A8 = Ra (5.8) 
and 
RR — 6B" = s (5.9) 
Note that (5.7) implies that 
A=+t/ltaa (5.10) 


so in particular A < —1 or A > +1. Then (5.8) fixes 8 in terms of a and R by 


B= X————Ha (5.11) 


1 
T T: T 
anm =] 5.12 
RTR ma eR 3 (5.12) 
Define 1 
R-( T\R 5.13 

( JIvaal+/I+aa) ) SM 

or equivalently 
1 ‘ 
R= (1+ aa) Ê (5.14) 


Then (5.12) implies 
ÊT Ê = l; (5.15) 


ie. R € O(3). Moreover, it is straightforward to check directly that 


det R = ————— det R (5.16) 


where we have used the formula det(I3 + Kaa’) = 1+ Ka.a for any K. Also, one can 


à 0 1 Aat 1 0 
A- P 5.17 
| a ’) | 0 nac | 0 J VO 


O(3) has two connected components, the connected component of I3 (which is SO(3)) 


write 


and hence À 
det A = ———— det R = 
l+taa 


whose elements have determinant det R = +1, and the connected component of —I3, whose 
elements have determinant det R = —1. 

There are therefore four connected components of the Lorentz group, according as 
A99 < —1 or A°9 > 1 and det A = +1 or det A = —1. It is not possible to construct a 
smooth curve in the Lorentz group passing from one of these components to the other. 

The set of Lorentz transformations with det A = +1 forms a subgroup of the Lorentz 


group. 
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Note that (5.13) implies that 


^ 1 
and hence 1 
B.B = Trane RKa = Q.Q (5.20) 


So, if A and A’ are two Lorentz transformations 


B À Br Ys X B 
(2 2) (4 2) T 


with A > 1 and X > 1, then 


(AA) o = AM + b.d > /1+ a.aV/1+ a.a — yaaa 2 1 (5.22) 


Hence the set of Lorentz transformations with A°9 > 1 also forms a subgroup of the 


Lorentz group. 

The subgroup of Lorentz transformations with det A = +1 and A? > 1 is called the 
proper orthochronous Lorentz group, which we denote by SO(3, 1)!. 

We note the useful lemma 


Lemma 10. Suppose that A € SO(3,1)!. Then there exist S1, S2 € SO(3) and z € R such 
that 


cosh z sinh z 
1 oT sinh z cosh z 
A= 
0 c 0 0 
0 0 


oreo 


0 
0 1 0f 

= 5.23 
(2 £) e 
1 


Proof 
From the analysis of the Lorentz group so far, we have shown that if A € SO(3,1)! 
then there exists a € R? and R € SO(3) such that 


Vl+a.a oTR 
A= 1 1 TYR 
a ( Ba Rey sere ee! ) 
— v 1-o.« aT 1 oT (5.24) 
E a pais z a22 0 Ê l 
There also exists $1 € SO(3) and z € R such that 
sinh z 
Q= $1 0 (5.25) 
0 


The result follows on substituting this into A and setting Sy = (51) È. M 
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5.2 The Lorentz Group and SL(2,C) 


Consider the spacetime co-ordinates z^". Define the matrices 


«3e ege ses 
0 1 1 0 i 0 0 =i 


Then given real spacetime co-ordinates x", define the 2 x 2 complex hermitian matrix 


0 3 1, ,,2 
m =A =% BA 
seat = ( i 5 a (5.27) 


Observe that any hermitian 2 x 2 matrix can be written as z for some real x". 
Note that 


det $ = (a°)? — (1)? — (x2)? — (23)? = nua" z" (5.28) 


Nuva” is invariant under the action of SO(3,1). det % is invariant under the action of 
SL(2, C), the complex 2 x 2 matrices with unit determinant. 


Proposition 30. There exists an isomorphism m : SL(2, C)/Zo + SO(3,1)! where SL(2, C)/Z» 
consists of elements +N € SL(2, C) with +N identified with —N. 
Proof 

Given N € SL(2,C) consider the 2 x 2 complex matrix NZN!. The components of 
this matrix are linear in the spacetime co-ordinates zr". As i is hermitian, it follows that 


NZN' is also hermitian. Hence there exist A“, € R (independent of x) for u,v = 0,...,3 
such that ru 
NàN! = (Az) (5.29) 


~ 


Taking the determinant of both sides we find det  — det (Az) for all x, and therefore A is 
a Lorentz transformation. 
Set 
A-—mn(N) (5.30) 


Note that 


Tr (Ax) = 2A°,,0! = Tr (NT N3) (5.31) 


Setting x° = 1, z! = x? = x? = 0 we find A95 = Tr (NİN) » 0. 
If Ni, No € SL(2, C) then 


(Ny No)@(NiNo)t = (n (Ni Na)a) (5.32) 
But 
(N1N3)&(NLN3)! = Ni(N3ENI)N] 


= Ni(x(Ns)z)N] 


——— 


= qn(Ni)n(No)z (5.33) 
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Hence (n (NT Na)a) = a(Ni)a(No)s for all x, which implies «(Ni N2) = m(Nı)r(N2). 
Next we will establish that 7 is onto SO(3,1)!. First recall that any R € SO(3,1)! of 


the form 
1 05 
R= ix 5.34 
E à) (5:34) 


can be written as a product of rotations around the spatial co-ordinate axes 


1 0 0 0 
0 1 0 0 
R = 5.35 
(1) 0 0 COS $1 sin $1 ( ) 
0 0 — sin $1 cos Q1 
1 0 0 0 
0 COS Q2 0 — sin ¢2 
R = 5.36 
2(¢2) 0 0 1 0 (5.36) 
0 sin $» 0 cos $» 
and 
1 0 0 0 
0 COS Q3 sin $3 0 
R = 5.37 
(6s) 0 — Sin $a COS $3 0 ( ) 
0 0 0 1 
ib; ; 
By a direct computation we find n(ez 7 ) = Rj for j = 1,2,3; and 
cosh z sinh z 0 0 
EAE sinh z coshz 0 0 
2a = 5.38 
m(e 27 ) 0 0 i. 45 (5.38) 
0 0 0 1 


where gie € SL(2,C) for j = 1,2,3 and eR” c SL(2, C). 

Hence, if A € SO(3,1)!, it follows that one can write A = A1.A2... Aj where A; are 
elementary rotation or boost transformations in SO(3, 1)!, and from the above reasoning, 
A; = 1(N;) for some N; € SL(2, C). Therefore, A = «(N1.N5... Ng), so 7 is onto SO(3, 1)!. 

Next, suppose that (NV) = 7(M) for N, M € SL(2, C). Then 


NN! = M&MÍ (5.39) 


Set Q = M-!N, so that 
QzQ'-i (5.40) 
Setting x? = 1, zt = z? = 23 = 0, we obtain QQ' = Ib, so Q € SU(2). Hence 


Qa! = 0'Q (5.41) 
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for i = 1,2,3. The only Q € SU(2) satisfying this is Q = tly, so M = +N. Hence 7 is a 
2—1 map. 

Lastly, we must prove that if N € SL(2,C) then 7(N) € SO(3,1)!. We have already 
shown that (NV) is orthochronous. Suppose that det(a(N)) = —1. Consider 


1 0 0 0 

à 0o 1 0 0 

A= N 5.42 
ooo ae bo E (5.42) 
0 0 0 -1 


The det A = +1, so A € SO(3,1)!. Hence, there exists some N’ € SL(2,C) such that 


A=7(N’'), so 


1 0 0 0 
: ; : : n(N) = a(N!) (5.43) 
0 0 0 —1 
Setting Y = N’N7!, we obtain 
1 0 0 0 
: : : ; zs (5.44) 
0 0 0 al 
for some Y € SL(2,C). This implies that 
Ya,otyt = goly + 210! + 290? — 430° (5.45) 


for all z,. In particular, for x? = 1, z! = x? = a? = 0 we find YY! = I5, so Y € SU(2). 
The remaining constraints are 


Yot =0!Y, Yo?— oó?Y, Yo? = —o?Y (5.46) 


This is not possible, because [Y, o1] = [Y,o?] = 0 implies that Y = oll; for some a € C. 
As det Y = 1 this implies Y = Io, but then Yo? 4 —o?Y. Hence if N € SL(2, C) then 
T(N) € SO(3,1)!. 

Although « : SL(2, C) — SO(3,1)! is not 1-1, we have shown that the restriction of 7 
to SL(2, C)/Zo, in which N is identified with —N is 1-1. M 


5.3 The Lie Algebra £(SO(3, 1)) 


'To compute the constraints on the tangent matrices, consider a curve in the Lorentz group 
A(t) with A(0) = I4. This is constrained by 


A(t) A(t) =n (5.47) 
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Differentiate this constraint and set t = 0 to obtain 


m4 qm —0 (5.48) 
where m — (SO Cus. The generic solution to this constraint is 
9. x 
m, = ( E j (5.49) 
for any x € IR? and S is a real 3 x 3 antisymmetric matrix; S = —ST. There are three real 


degrees of freedom in x and three real degrees of freedom in the antisymmetric matrix S. 
Hence the Lie algebra is six-dimensional. 
Define the 4 x 4 matrices M"" for u,v = 0,1,2,3 by 


(M) = i(t g — na) (5.50) 


note that M"" = — M"", so there are only six linearly independent matrices defined here. 
By direct computation, we find 


0 i 0 0 0 0 i 0 
M” = i 0 0 0 M = 0 0 0 0 
0 0 0 0 i 0 0 0 
0 0 0 0 0 0 0 0 
0 0 0 i 0 0 0 0 
M? - 0 0 0 0 M? = 0 0 —i 0 
0 0 0 0 0 i 0 0 
i 0 0 0 0 0 0 0 
0 0 0 0 0 0 0 0 
M” = 0 0 0 —1 M = 0 0 0 0 (5.51) 
0 0 0 0 0 0 0 —i 
0 i 0 0 0 0 i 0 
and 
[M"", MP7] = i( MnP + MPR — MHP — MH.) (5.52) 
which defines the complexified Lie algebra of the Lorentz group. 
Define 
1 
Ji = gie Mjk 
Ki = Moi (5.53) 
for i, j,k = 1,2,3. Then it follows that 
[Ji Jj] = 1€ijk Ik 
[K;, Kj] = —i€ijk Jk 
[45,5] = iei jk Ky (5.54) 
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50, setting 


1 1 
A4i=5(i- iKi) — Bie 5(Ji iK) (5.55) 


we obtain the commutation relations 


[Ai Aj] = tek Ax 
[Bi, Bj] = iei Bk 
[Ai, Bj] = 0 (5.56) 


Hence the complexified Lorentz algebra £(SO(3,1)) can be written as the direct sum of 
two commuting complexified Z(SU(2)) algebras. It follows that one can classify irreducible 
representations of the Lorentz algebra by spins (A, B) for 24,2B € N. 


5.4 Spinors and Invariant Tensors of SL(2,C) 


Definition 44. The left handed Weyl spinors are elements of a 2-dimensional complex 
vector space V on which the fundamental representation of S L(2, C) acts via D(N)w = Nw 
where N € SL(2,C). In terms of components, if » € V has components pa for a = 1,2 
with respect to some basis of V, then under the action of SL(2, C), w transforms as 


Ya > Wy = Naf vg (5.57) 
where N € SL(2,C). 


Definition 45. The right handed Weyl spinors are elements of a 2-dimensional com- 
plex vector space V on which the complex conjugate of the fundamental representation of 
SL(2,C) acts as D'(N)x = N*x where N € SL(2,C) and N* is the complex conjugate 
of N. In terms of components, if X € V has components Xa for à = 1,2, then under the 
action of SL(2, C), x transforms as 


Xa > Xa = N'aPX5 (5.58) 


where N € SL(2,C). 
Note: One should regard a and à as being entirely independent! The components of 
these spinors anticommute. 


We also define c^? and Eg to be totally skew-symmetric with 


b | nd 7 EM | : " (5.59) 


and observe that «eg, = 6%,. One defines c? and e a Similarly. 
Note that cag is invariant under SL(2, C), as 


Cog — CB = e," No" Ng" = det(N)eog = Eag (5.60) 


or in matrix notation NeNT = e. 
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If we define the contravariant representation Doy on V via 
Dov(N) > (NT) t4 (5.61) 
and the complex conjugate contravariant representation by 
Div(N)X > (NT) Tx (5.62) 


then NeNT = e implies that (NT)-! = e^! Ne, so Doy is equivalent to the fundamental 
representation. The complex conjugate representations are similarly equivalent. 

The tensors e45 and €^? are called invariant tensors as they transform into themselves 
under the action of SL(2, C). For SO(3, 1), the invariant tensors are rj, and r"", which 
can be used to raise and lower indices. We will raise and lower SL(2,C) indices using ^^, 
Eg, SO if Wa, Xa are in the fundamental and conjugate representations respectively, then 
we define 

pr= rg, xe m Shy, (5.63) 

One can construct a tensor product representation of the fundamental representation 
acting on n products of V, V V &V---&V. In terms of components, elements of the 
tensor product vector space have components 1/5,...4,, which transform under the action of 
SL(2, C) as 

Vo... S T MD = Na ee No, Pig, ..-Bm (5.64) 


for N € SL(2,C). Similarly, tensor product representations of the complex conjugate 
representation correspond to complex tensors with components X4,..9; Which transform 
as 
c v LON*. Aa x. mo. . 
By taking the tensor product n tensor products of V acted on by the fundamental 
representations, with m tensor products of V acted on by the conjugate representation, one 


obtains a vector space which has elements with components w > „ Which transform 


0... [1 ... d 
as 


Voss as I: V = Noa” O Na N gn ED: N^ aie). Pe (5.66) 


01... Qm 1 -.-Bm 


This representation is in general not irreducible. 


5.4.1 Lorentz and SL(2,C) indices 


It is straightforward to map between Lorentz invariant tensors and SL(2,C) invariant 
tensors. In particular, recall that the relationship between N € SL(2,C) and the corre- 
sponding Lorentz transformation A = A(N) is given by 


Ngo" Ni = n, AP a" a" (5.67) 
which implies that 


No" Ni = g,, AP p" o" = (nt At nto” = (A Do" (5.68) 
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So, denoting the components of o” by c" j| one finds 


0.5 — Na* NT A" oos (5.69) 


a 


which implies that c" P is invariant. One can also define 


2Hjóa _ ag àb H 
(c^) ere a d (5.70) 


0 


so that ° = a9. g* = —o' for i = 1,2,3. 


Exercise: Prove the following useful identities 
i) ota” + o" a^ = mI 
ii) Tr og” = 2n 
iii) of 09 = 25058. 


Definition 46. Define the 4 x 4 matrices 4" by 
0 c" 
H 5.71 
y | ap D (5.71) 


UY") S Y Y tt = onl (5.72) 


An algebra satisfying this property is called a Clifford algebra. 


Then these matrices satisfy 


Definition 47. A Dirac spinor V p is a 4-component spinor constructed from left and right 


Vp | 1 (5.73) 
X 


The gamma matrices act on Dirac spinors. 


handed Weyl spinors Ya, X* via 


The o-matrix identities are useful. For example; recall that the correspondence be- 
tween A € SO(3,1)! and N € SL(2,C) is given by No"N! = n pA’ n" a". Then using 
(ii) above the components of A are given by 


A“, = jn (a^ No, NÝ) (5.74) 


Also, it is straightforward to relate tensors with SL(2,C) indices to tensors with 
Lorentz indices. Given a 4-vector with Lorentz indices V^ one can define a tensor with 
SL(2, C) indices via 

Vas = V"(a i)aà (5.75) 
The invariance of (o”)aġ ensures that if V” transforms as V^ — A", V" under the action 
of the Lorentz group, then Vag transforms as Vas — NÊN ae Vig under SL(2,C). This 
expression can be inverted using (ii) of the above exercise to give 
1 
yes 3 (0 Y^ Vas (5.76) 
Similar maps between higher order tensors can also be constructed. 
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5.4.2 The Lie algebra of SL(2, C) 


The Lie algebra of S L(2, C) consists of traceless complex 2 x 2 matrices; which has six real 
dimensions. This is to be expected, as £(SO(3, 1)) is six-dimensional. It is convenient to 
define the matrices 


(g^) B (ota — gg P 
(aM) = g — aa") à (5.77) 
so that c?! = —io!, otk = je Mot. a= tot, gik — legt, It is clear that the c"" span 


the 2 x 2 traceless matrices over R (as do the a”), hence they are generators of the Lie 
algebra of S L(2, C). By a direct computation we obtain the commutation relations: 


[oh o°] = j (n ^a^ + ggg"? — oP — gg") (5.78) 


which is the same commutation relation as for the Lie algebra £(SO(3, 1)). Similarly, we 
find 
lah c? = (n ^a" + ng = ane = nea?) (5.79) 


Hence the o/” and g”” correspond to representations of L(S0(3,1)). 
The action of SL(2,C) on left-handed Weyl spinors is given by 


Ya — (er) Pis (5.80) 


Just as for the Lorentz algebra, one can define J; = tiko; = 30! and Kj; = 09; = tot. 
Hence A; = 50", B; = 0. Therefore the fundamental representation corresponds to a spin-3 
EL (SU (2)) representation generated by A, and a £(SU(2)) B-singlet. This representation 
is denoted by (3,0). 
The action of SL(2, C) on right-handed Weyl spinors is given by 

Xa ae (chio (5.81) 
Again, define J; — ie)"; = io! and Ki = do; = —io. Hence A; = 0, Bi = 50". 
Therefore this representation corresponds to a spin-4 £(SU(2)) representation generated 
by B, and a £(SU(2)) A-singlet. This representation is denoted by (0, 5). 


5.5 The Poincaré Group 


'The Poincaré group consists of Lorentz transformations combined with translations; which 
act on the spacetime co-ordinates by 


ch — Apr” + b" (5.82) 


where A is a Lorentz transformation and b € R^ is an arbitrary 4-vector. One can denote 
the generic Poincaré group element by a pair (A, b) which act in this way. Note that under 
the action of (A,b) followed by (A’,b’); x —^ M Ax + A'b + 0’, hence one defines the group 
product to be 

(A’, b) (A, b) = (A'A, A'b + 0’) (5.83) 
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so the Poincareé group is closed under this multiplication. The identity is (14,0) and the 
inverse of (A, b) is (A71, —A-1b). 
One can construct a group isomorphism between the Poincaré group and the subgroup 
of GL(5, R) of matrices of the form 
A b 
5.84 
(2j " 


where A is a Lorentz transformation and b is an arbitrary 4-vector, as under matrix mul- 


/ / / / / 
| A 1) | A ) -( A'A i5 tus 
0 1 0 1 0 1 


5.5.1 The Poincaré Algebra 


tiplication 


Consider a curve 
| A(t) 28 (5.86) 


in the Poincaré group passing through the identity when t = 0, so A(0) = I, b(0) = 0. 
Differentiating with respect to t and setting t = 0 we note that the generic element of the 


(tj) bn 


where m € £(SO(3,1)) and v € R^ is unconstrained. Hence a basis for the Lie algebra is 
given by the 5 x 5 matrices M"" and P" for u,v = 0,1,2,3 where 


Poincaré Lie algebra is of the form 


(MP), = ifa, — sot) 


(M — (MP a= CM, 0 (5.88) 
and 
(P^)', = int” 
(sep ese (5.89) 


(labeling the matrix indices by u,v = 0,1,2,3 and the additional index is ^4"). The M^? 
generate the Lorentz sub-algebra 
m 0 
5.90 
( 0 jJ (5.90) 


for m € £(SO(3, 1)); they satisfy the usual Lorentz algebra commutation relations 


[M , MP?) = i(M"" n"? + MPR — MHP nv? — MY nt) (5.91) 
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The P" generate the translations 


0 iv 
CD T 


[P^, P"] 20 (5.93) 


for v € R*. The P" satisfy 


and 
P", M??] = in?" p? — ij?" p? 5.94 
7] 7] 


The commutation relations 


[Me Mer] = i( Mg? + MPRI — MP — Meg?) 
[P Mr?) = in” P? — in?” PP 
[P,P] = 0 (5.95) 


define the Poincaré algebra. 


5.5.2 Representations of the Poincaré Algebra 


Definition 48. Suppose that d is a representation of the Poincaré algebra. Let ej,55 be 
the totally antisymmetric tensor with €9193 = 1. Then the Pauli- Lubanski vector is defined 
by 


1 O V 
W, = z Erpovd( M? )d(P”) (5.96) 
Proposition 31. The Pauli-Lubanski vector satisfies the following commutation relations: 
1) [W,, d(P,)| = 0 
2) [Wi d(Mpc)| = inugWs — Nyo Wp 
3) [W,, Wr] = —icpvpo W?d(P") 
Proof 
We will use the identities 
Epapy e! = —65" 195” 587. = —66l^,6" 5571. (5.97) 
and 
Emap rT = —46° a0 g = —45 571, (5.98) 
To prove (1) is straightforward: 


[Wn d(P,)] = Seuooe M?" (P^), d(P,)] 
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puna (dM? CPP), d P,)] + [d(M??), dCP,))d (P^) 


= twee dM" (P^, P,]) + apu", Pap?) 
= 5000 — ió^,d(P^) + 15°, d(P?))d(P®) 
EH (5.99) 


To prove (ii) is an unpleasant exercise in algebra: 


[Wy d(Mgc)] = zeuxxeld(M^X)d(P^), d( Mpa) 

cuaxo (d(M**) [a(P), d(Mpa)] + [d(M**), d(Mpa)|d(P*)) 
exo (d(M**) q([P*, Mpo]) + d(LM^*, Mpo])d(P*)) 

(d(M*) (16° ,d(P,) — io? ,d(P,)) 

+ i(d(M*g)%p_ — d(MX,)9^, — d(M^,)8*, + d(M*,)9^, )d(P)) 


a 
= gf (d(M**)5°,d( Ps) — d(M**)65°,d(P,) 


+ 24(M^,)8X d( P^) — 2d(M*,)5*od(P*)) 
i 
z 5 Mord p = Nor 5" a) eux (d(M**)d(P’) = 2d(M"")q(PX)) 
3i 
= 3 09, c Npr Oa) Euro (M PX)d( P) (5.100) 
But 
1 
xT WI = 3€ vans d(M'"2)d(P"3) 
= 3d(M*)d(P7!) (5.101) 
Hence 


a T 
[Wi d(Mpc)| = z (Mord p z Tloró a) Eurxoe™ QW? 
a T: T 
= 5 (orð p — npe ^u) (72) (9" unies, — 5” ani )W? 
= inupWo — inu Wp (5.102) 


as required. 
(3) follows straightforwardly from (2): 


Wi, Wo] = Fevpoo [Wu d MP?) d( P*) 


5 vooo (Wi. AMP? )]a(P%) + dM?) |W, dP?) 
see. d(M^")Jd(P^) 
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= sept i^ W? — i$" ,WP)q(P^) 
= —icuvpo WPd(P") (5.103) 


as required. W. 
From this we find the 


Corollory 3. The following commutation relations hold 
1) [W W", d(P,)] — 0 
2) (WW, d(Mpo)] = 0 


Proof 
(1) follows because [W W", d(P,)] = W,[W",d(P)] + [IW,, d(P,)|W" = 0 
(2) holds because 


[Wu W", d(Mpo)| = W,[W", d(Mpo)| + [Wy d(Mpo)|W" 
= W,,(i5" Wo — i0" ,W,) + (itupWo — inyo Wp)W” 
-0 (5.104) 


as required. M 
Hence we have shown that W,,W" is a Casimir operator. d(P,,)d(.P") is another Casimir 
operator: 


Proposition 32. The following commutation relations hold 
1) |d(Py)d(P*), d(P,)| = 0 
2) [d(Py)d(P*), d(Mpo)] =0 


Proof 
(1) follows because 


[d(P,)d(P^), d(P,)] = d(Pu)ld(P*), d(P.)] + [d( Pu), (P)]d(P") = 0 (5.105) 


(2) holds because 


d(P,,)[d(P"), d(Mpo)| + [d( Pu), d(Mps)] d (P^) 
= d(Py)(id"pd( Po) — ið" ,d(P)) + (inupd(Po) — inuod(Pp))d(P") 
0 (5.106) 


as required. M 
We shall show that irreducible representations are classified by the values of the two 
Casimir operators W,,W" and d(P,,)d( P"). 
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In particular, suppose that D is a unitary representation of the Poincaré group acting 
on V. Such representations arise naturally in the context of quantum field theory when 
V is taken to be a Hilbert space, and it is assumed that Poincaré transformations do not 
affect transition probabilities. We will assume that this is the case. 

Note that iM,, and iP" form a basis for the (real) Poincaré algebra. Hence one can 
locally write the Poincaré transformation as 


e7 $ bn P" reu, MP") (5.107) 


for real b, and skew-symmetric real w,,, and 


D(e7 En Prou MH) —. o Oud (P) eos d (Mn) (5.108) 


where d is a representation of the Poincaré algebra acting on V. As D is unitary, d( Mpo) 
and d(P") are hermitian. 

As the d(P") commute with each other and are hermitian, they can be simultaneously 
diagonalized, with real eigenvalues. For a 4-vector q” define the subspace V, of V to be 
the simultaneous eigenspace 


V; = (v) € V : (P^) |b) = d" |v) , w= 0,1,2,3} (5.109) 


and 


v=Q\V, (5.110) 


Then on Vy, d(P;)d(P") = ¢“q = q?. We will assume that for configurations of 
physical interest, such as when q is the 4-momentum of a massive particle or of a photon, 
that q? > 0 and q? > 0. We will only consider these cases. 

Consider first the operators 


h^(t) = e Ond PH) twr d( MM) q( pr) em Z Gud PH) ted MM) (5.111) 
Differentiating with respect to t we find 


à ; 
an — er (ud( P^), d(M"")) [d(P>) 


dt 5 (bud (P^) + ou MH) e EAP Hera oer) 


wò e Ond P rd 4) qp Px) e~ $ (ud (PH) ad Qro) 


= wò AX (5.112) 


with the initial condition A^(0) = d(P*). Therefore 
h(t) = (e'^)^ ,d(P?) = (e 2» M""^ (PP) (5.113) 
Hence, setting t = 1 we find 


d( P^)e- 3d? ) rou d (MP?) = (e à MM) e7 bu APH) +e vd MM) qt pe) (5.114) 


So, if |p) € V, then e-30«4P^) reus d("?) p) E Vy where g! = e 3 M", 
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Definition 49. The stability subgroup H4 (or "little group”) which is associated with V, 
is the subgroup of the Poincaré group defined by 


Hy = {672 On P" tou MEY) 5 o= 3 0ud( P^) red ( M") ly & V. for |b) € V;) (5.115) 


It can be shown that H, is a Lie subgroup of the Poincaré group. Suppose then that 
E it v 
— (by P" + wM”) € £(H,). It follows that e^ 2%. M" g = q for t € R. Expanding out 
in powers of t we see that this constraint is equivalent to 


Gp =0 (5.116) 


which has a general solution 
Wuv = Epvpo n? Q (5.117) 


where n? is an arbitrary constant 4-vector. Hence, if |v) € V} and e 3 Ou P^ rou, MH”) e Hy 
then 
eT E bud PH) ou dM") [py — e 52nd" pint Wa Jap) (5.118) 


so we have reduced the action of H, on V, to the action of W,, on Vg. 
The action of a generic Poincaré transformation (A,b) does not leave V, invariant, 
because 
Vy > Vj = Vag (5.119) 


as q' = Aq. However, q? = q" is invariant. Hence we can split V into invariant subspaces Va 
corresponding to vectors q^ which have the same value of q?. We will therefore henceforth 
work with such an invariant subspace, and consider q? = m? to be fixed. 

If m? > 0 then there is a Lorentz transformation A'(q) such that q = A'(q)k where 
k" — (m,0,0,0). Alternatively, if m — 0 then there is a Lorentz transformation A(q) such 
that q = A'(q)k where k” = (E, E,0,0). These Lorentz transformations can be taken to 
be fixed functions of the q. 

The key step is to show that the action of the entire Poincaré group on the V, (with 
q? = m? fixed) is fixed by the action of Hj on V; (which is in turn determined by the 
action of the Pauli-Lubanski vector on V). 

To show this, first note that if |y) € Vk, then one can write 


lha) = D(A" (a), 0) |vx.) (5.120) 


where |w,) € Vz, and these transformations can be used to obtain all elements of V, from 
those in Vj. 

It is then straightforward to show that the action of the representation of the whole 
Poincaré group on (V, : q? = m?) is determined by the action of D(H;,) acting on Vp. 

To see this explicitly, suppose that |Y m) for M =1,..., lp is a basis of Vp. Then one 
can define 


Ig.) = D(A (a), 0) lk, m) (5.121) 
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and the |q, m) then form a basis for V}. The representation of Hy, on Vp is determined by 
the coefficients D(h) wn, where h € Hy, in the expansion 


h) |i ur) = 2n )u |n, N) (5.122) 


Suppose that (A, b) is a generic Poincaré transformation; then one can write 


D(A, b) |o. = D(A, b)D(A’(q), 0) |i, m) 
D(A'(Aq),0)D(A' (Aq) AA (q), A(Aq) !b)|Ux,u) (5.123) 


However, ((A (Aq))- 1 AM (q), A (Aq)-!6) € Hy, so it is possible to expand 


D(A, b) |a, m) E (A (Aq), 0)D(A' (Aq) AA (q), A'(Aq) Ib) lx.) 


= Y (A'(Aa)- 1 AA (a), A (Aq) b)urw long) (5.124) 


We will examine the action of Hj on Vj; in the timelike and null cases separately. 
Although 


v 204v (5.125) 


is not in general finite-dimensional, we shall assume that V; (and hence the V;) are 
finite dimensional. 


5.5.3 Massive Representations of the Poincaré Group: k” = (m,0,0,0) 
We compute the action of W,, on Vp. If |W) € Vi then 
1 " 
Wo lw) = zeied(M")a( P^) |) = 0 
1 
Wi |b) = —;eijed( M?" )d(P^) |i) = —md(J;) |) (5.126) 


We have already shown that d(J;) generates a £(.SU(2)) algebra, hence the little group for 
massive representations is $O(3). For irreducible representations, the spin is fixed by the 
value taken by the Casimir on Vi; W,W" |y) = —m?4d(J;)d(J;) |v). 

5.5.4 Massless Representations of the Poincaré Group: k” = (E, E,0,0) 

Again, we compute the action of W,, on Vp. If |v) € V; then 


Wo lé) = zeged( Ma P^) lj) = d(M?a(P!) |i) = Edl) |o) 
Wi |) = —Ed(M*9) lj) = -Edl h) lb) 
Wa |b) = zeaad(M"")d(P^) |) 
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(d(M'*) — (M?) |y) 
E(C-d(42) + d(K3)) |) 
Ws |Y) = zesuvAd(M""d(P^) |) 
= E(-d(M?) + a(M™)) jy) 
= E(—d(J3) — d(K3)) |v) (5.127) 


Observe that the following commutation relations hold: 


(—d(J2) + d(K3), —d(J3) — d(K2)] = 0 
[d( Ji), —d(J3) — d(K2)] = —i(—d(J2) + d(K3)) 
[d(J1), —d(J2) + d(K3)] = i(—d(J3) — d(&2)) (5.128) 


These expressions may be simplified slightly by setting Rı = —d(J3) — d( K2), Ro = 
—d(J2) + d( K3), J = d(J1); so that 


[Ri R] 20, [J,Ri]=—-iRo, [J, Ro] = iRı (5.129) 


Rı and Rə are commuting hermitian operators on Vi, and hence can be simultaneously 
diagonalized over R. Consider a state |v) € Vp with Ri |w) = ri |W), Re |v) = ra |v) for 
ri,T9 € IR. Define 


f(8) e ^ Re ih 
g(0) = e Roe y) (5.130) 


for 0 c R. Differentiating with respect to 0 and using the commutation relations we 
find T = —g, ag = f. Solving these equations with the initial condition f(0) = rı |v), 
g(0) = ra |Y) we find 


f(8) = (r1cos0 — rosin0) |Y) 
g(0) = (ri sin 0 + ra cos 0) |Y) (5.131) 


which implies 


Rie! |) = (r1 cos 0 — ra sin 0)e/9/ y) 
Rye’? |) = (r sin 0 + r2 cos 0)e/97 |y) (5.132) 


Hence, unless rı = rg = 0, there is a continuum of R4, R5 eigenstates which implies Vg 
cannot be finite-dimensional. We must therefore have Hj; = Ro = 0 on Vy. J is also a 
hermitian operator on V;, and can also be diagonalized. For irreducible representations, J 
can have only one eigenvalue, o € R. c is called the helicity of the particle. It follows that 
W, = ck,, so it is clear that o is a Lorentz invariant quantity. 

There is no algebraic constraint fixing the value of the helicity ø in the massless case, 
as there is to fix the spin in the massive case. However, for physically realistic systems, one 
can make a topological argument to fix 2e € Z. This is because e'?7 describes a rotation of 
angle 0 in the spatial plane in the 2,3 directions. So in particular, setting 0 = 27 we find 


ei |) = ert yy) (5.133) 
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We require that e?” = +1 (for a projective representation) and so 2e € Z. Neutrinos 


L1 
C3) 


have helicity +5, photons have helicity +1 and gravitons have helicity +2. 
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6. Gauge Theories 


Lie groups and Lie algebras play an important role in the various dynamical theories which 
govern the behaviour of particles - the gauge theories. Though we will not examine the 
quantization of these theories, we shall present the relationship between Lie algebras and 
gauge theories. 

Before examining non-Abelian gauge theories, we briefly recap some properties of the 
simplest gauge theory, which is the U(1) gauge theory of electromagnetism. 


6.1 Electromagnetism 


'The gauge theory of electromagnetism contains a field strength 
fuv = pav = Ova (6.1) 


where u,v = 0,1,2,3 and x” are co-ordinates on Minkowski space (indices raised /lowered 
with the Minkowski metric 7), and 0, = Ze. a, € R is the 4-vector potential. 


a 


Under a gauge transformation aj, — aj, 


a, — O,A where A is a real function, 
n E fiy = On (ay Pe Oy) z Ov (ay, zz On) = nay "2d Orap = fuv (6.2) 


since 0,0,A = pð A. Hence fy is invariant under gauge transformations. 
The field equations of electromagnetism are 


Dos = jv (6.3) 


and 
go dor (6.4) 


Equation (6.4) holds automatically due to the existence of the vector potential. Conversely, 
if fav satisfies (6.4) then it can be shown that a vector potential a, exists (though only 
locally) such that fj, = 0,a, — Orap- 

Using the vector potential, one defines a covariant derivative D, by 


Dub = Ob + tay (6.5) 

where v» = w(x). Under a gauge transformation 
yp — yp = ey, ay, — a, = Qp — pÀ (6.6) 

where A = A(x), it is straightforward to see that 


D, — (Duby = dle) + ilap — ALAY 
eD, (6.7) 


so that D, transforms like v. This means that the Dirac equation 


iy D,Y — mV = 0 (6.8) 
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is gauge invariant. V is a 4-component Dirac spinor constructed from left and right handed 


v= ( 2 (6.9) 
X 


and the 4 x 4 matrices ?/" are given by 


0 oF 
y= | zh s) (6.10) 


These matrices satisfy the Clifford algebra 


Weyl spinors v4, X via 


ys E qq = ans (6.11) 


The standard Lagrangian governing the interaction of electrodynamics with scalar 
fields 


L= - fuf" + 5(Dyd)" D'ó- VGO) (6.12) 


where $ is a complex scalar field, and V is a real function of $*ó is also gauge covariant. 
It is possible to obtain the gauge field strength in a natural way from the commutator 
of covariant derivatives. If $ is a scalar field then 


D,D, EX (On m iau) (Ov + ia, ) 
= 0,0,0 + ilað h + a,0,9) — aparo + ið avo (6.13) 


and hence 

[Dus D,]ó = i(ð av m vap) =ifwo (6.14) 
6.2 Non-Abelian Gauge Theory 
6.2.1 The Fundamental Covariant Derivative 


Suppose that G is a compact matrix Lie group acting on a vector space V via the funda- 
mental representation. Consider a scalar field ®(x) which is an x dependent element of 
V (which can be thought of as a column vector of ordinary scalar fields). Suppose that ® 
transforms under the fundamental representation as 


(x) — $'(x) = g(x) P(x) (6.15) 
where g(x) € G. 
Definition 50. The fundamental covariant derivative D,, is defined by 
D,9 = 0,9 + Ay, ® (6.16) 


where A, € L(G) is an element of the Lie algebra of G acting on V. 
We require that D,,® should transform under local gauge transformations in the same 
way as 9. Suppose that A, — Ai, under local gauge transformations. Then we need 


0,9 + Al ®! = g(0,0 + A,9) (6.17) 
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which implies 
O,(g9) + A gÈ = g(0,9 + ALÈ) (6.18) 


and hence 
0,99 + A,gó = gA,9 (6.19) 


As this must hold for all ®, we find the transformation rule 


A, = gAug | —Ougg | (6.20) 


Before proceeding further, there is a question of consistency: namely if A, € L(G) then 
we must verify that Aj, given above is also an element of L(G). This is proved using the 


Lemma 11. /f g(t) is a curve in the matrix Lie group G then 49 (t)! € L(G). 
Proof 

Suppose that g(t) = go when t = to. Set h(t) = g(t + to)gg |. Then A(t) is a smooth 
curve in G with h(0) = I, and 


dg = dg . 
=|= = d; =t i (8 H li=to (6.21) 


But dh | io € L(G) by definition, and hence (42971) [tg € L(G) for all to. 8 
Hence we have shown that O,gg | € L(G), and from our previous analysis of the 
adjoint representation, we know that gA,g ! € L(G); so A’, € L(G) as required. 


6.2.2 Generic Covariant Derivative 


Definition 51. Suppose that G is a matrix Lie group with representation D acting on V, 
and let d denote the associated representation of the Lie algebra acting on V. Let elements 
0 € V transform as 0 > 0' = D(g(x))0 under local gauge transformations. 

Then the covariant derivative D,, associated with D acting on V is defined by 


D,0 = 0,0 + d(Ay)0 (6.22) 


where A, € L(G) transforms as A, > Aj, = gAug |— O,gg |. 
In order to show that this covariant derivative transforms as D,,0 — D(g)D,0 we must 


prove the 


Lemma 12. Suppose that D is a representation of G acting on V, with associated repre- 
sentation d of C(G) acting on V. Then 


i) If v € L(G) and g € G, then d(gvg- |) D(g) = D(g)d(v) 
ii) If g(t) is a curve in G then dP) p(g-!) = d(S8g-!) 


Proof 
(Caveat: in this proof t is simply a parameter along a curve in G, not the spacetime 
co-ordinate x°’) 
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To prove (i), set g = e^ for h € L(G), so that D(g) = e"), Then (i) is equivalent to 


eh q(ehyg 7h) ed) = d(v) (6.23) 
Set 
F(t) = e 00 deve) gtd() (6.24) 
for t € R. Then 
a = ~ta) ((d(e"’ve), a(h)] +d eth vJe t^) et) 
— e&-td(h) (d([c^ ve^. h]) 4 d(e'^[h, ve t^) etat 
—td(h) (dle v hje ^) m d(e'^[h, ve t^) etat) 
z (6.25) 


and f(0) = d(v). Hence, f(1) = e-"U9q(e^ue-^)ed) = f(0) = d(v) as required. 
To prove (ii), suppose g(t) = go at t = to. Set h(t) = g(t + to)gg ^, so that h(t) is a 
smooth curve in G with h(0) = I. Then 


dt D(g7")\t=to = bo Pl") = —d = (6.26) 
on d dh(t) 
g 
a Fo hin) = a" io) (6.27) 


As h(0) = I we can set h(t) = e'/1*9() for some constant matrix hı. Then 


dh(t) 


di [zo = hı € L(G) (6.28) 
aO y = al) (6.29) 

But by definition : " " 
d(hi) — qe Dess q P0) lio (6.30) 
'Therefore dD(g),, , "E s - 
de (g let = (hi) = e tto) (6.31) 


are required. M 


Proposition 33. The covariant derivative D,, associated with D transforms as 


D,0 > (D40) = D(g) D,0 (6.32) 
under local gauge transformations. 
Proof 
Note that 


8,0 + d(Al,)6" = O,(D(g)0) + d(gAug | —0,gg-  )D(g)0 
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= D(g)(8,0 + d(A,)0) + (d(gAug ')D(g) — D(g)d(A,) 

+ 9, (D(g)) — dð gg !)D oN 

= D(g)D,0 + (d(gAug !)D(g) — D(g)d(A,))0 

+ (8,(D(g)) — d(8,99^ 5 (6.33) 


However, by the previous lemma, we have proved that d(gA,g !)D(g)—D(g)d(A,) = 0 
and 0,,(D(g)) — d(0,gg !)D(g) = 0, hence 


9,0' + dA)” = D(g)D,0 (6.34) 


as required. M 
Given this property of transformations of covariant derivatives, one can define the 
adjoint covariant derivative 


Definition 52. Suppose that 0 € C(G) transforms under the adjoint representation Ad of 
G. The the covariant derivative associated with the adjoint representation is 


D,6 = 0,0 + (ad Aj) = 8,0 + [A,, 6] (6.35) 


To summarize, we have shown that if ® transforms under the action of the fundamental 
representation as ® — 4' = gó, then in order for the fundamental covariant derivative to 
transform in the same way, one must impose the transformation 


Ay — A, = gAug | — 049g! (6.36) 


on the gauge potential. We then have shown that if ® transforms under the action of a 
generic representation D, ® — ©’ = D(g)®, then the same transformation rule A, —^ 
Al, = gAug | —0,gg ^! 
9,9 + d(A,,)® also transforms in the same way as ®. Caveat: a covariant derivative is 


is sufficient to ensure that the generic covariant derivative D,,® = 


always defined with respect to a particular representation 


6.3 Non-Abelian Yang-Mills Fields 


Following from the relationship of the U(1) electromagnetic field strength with the com- 
mutator of the U(1) covariant derivatives acting on scalars, we consider the commutator 
of the fundamental covariant derivative D, acting on (x) € V, which transforms under 
the fundamental representation as ® — 4' = gó: 


[D,, Di]® = (8, + Ap) (0/9 + A,9) — (8, + AJ) (0,9 + Ap) 
= (ð Av — OA, + [Ay, A,]) ® (6.37) 


Definition 53. The non-abelian Yang-Mills field strength is 


Fav = ð, Ay — 3p Ap + Ag A] (6.38) 
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Note that as A, € L(g) it follows that Fay € L(G). Note also that by construction 
[D D] transforms like ® under a gauge transformation. Hence if F}, is the transformed 
gauge field strength, then F/,,®’ = gF,,,®. As this must hold for all ®, we find 


Fiy = gFug * (6.39) 


so that F transforms like the homogeneous part of Ap. 
Exercise: Verify this transformation rule for F,, directly from the definition F,, = 
ô Ay — Oy A, + [An Ay] together with the transformation rule of A,,. 


Lemma 13. The non-abelian field strength satisfies 
D,DEp =0 (6.40) 


where here D, is the adjoint covariant derivative. 
Proof 


D,D,F"" = D,(8,F'" + [A,, F""]) 
= 0,0,F"" + [A,, 0, F""] + O,[Ap, F""] + [An [A,, F] 
= [A,,, DF" + [3 An, F""] + [Aj, 9, F""] 


1 
= [Ay DFM) + lEn — [Ap Av], FY] + [Av, D,F"" — [A,, F""]] 


2 
= [A,, D,F""] + [A,, D,F""] — EAn Av], E] — |A,, [A n, F""]] 
1 V V 
= -3lAu: Av ghk ] [Aus [An F” ]] 
=) ( using the Jacobi identity) (6.41) 


as required. M 
6.3.1 The Yang-Mills Action 


Definition 54. The non-abelian Yang-Mills Lagrangian is 


1 V 
= ga" Fm» F” ) (6.42) 


where « is the Killing form of the compact matrix Lie group G. 


Proposition 34. The non-Abelian Yang-Mills Lagrangian is gauge invariant 
Proof 


Under a gauge transformation 
(Fy, FH”) — s(gFuug |, gF" g) (6.43) 


Suppose that X,Y, Z € L(G). Then for t € R, compute 


d 
qe Xe, eye) E &(—et [X, VA RR py 4 &(e Xe ^, —e' ly, Ble.) 
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k(—|e Ne Z], Ve to wd xe to le" Ve. t^ 7] 
= one xe [Z et^ye 7) ele xe le Ve te Z] 
=0 (6.44) 


where we have used the associativity of the Killing form. Therefore 


klef Xe^7, e? Ye-7) = «(X,Y) (6.45) 


and hence it follows that 
K(gF wg’; gF""g 1) =K aE) (6.46) 


as required. M 

Note that the coupling constant e plays an important role in the dynamics. If one 
attempts to rescale A so that A, = eA, it is possible to eliminate the explicit factor of e 
from the Yang-Mills Lagrangian, and write 


at (Fw, P") = (By, Pr") (6.47) 
where here Ê nes 0,À, —0, À, --e[À,, Â]. Although the explicit e-dependence of the Yang- 
Mills Lagrangian appears to have been removed, observe that the gauge field strength now 
has an e-dependent term, which arises from the commutator which is quadratic in A. So the 
dependence on e in the non-abelian theory cannot be removed by rescaling. If, nevertheless, 
one performs this rescaling (and then drops the ^on all terms), then the generic covariant 
derivative is modified via D,,® = 9, + ed(A,,)®, and the gauge potential transformation 


1 


rule is also modified: A, — a, = gAug |—e lü,gg |. Whether e appears as an overall 


factor in the Yang-Mills Lagrangian, or within the covariant derivative and gauge field 


strength, depends on convention. Until stated otherwise, we shall however retain the a 


e2 
outside the Lagrangian, and work with the un-rescaled gauge fields. 
If the representation D of G is unitary, then one can couple addtional gauge invariant 
scalar terms to the Yang-Mills Lagrangian: 


L F, 


1 
uv, FH”) + 5 (D,9)  D^o — V(®'9) (6.48) 


= qa" 


where ® — D(g)® under a gauge transformation. 


6.3.2 The Yang-Mills Equations 


Consider a first order variation to the gauge potential A, — A,+dA,. Under this variation 

Fay > Fu + (059A, + [Ap 6Av]) — (Ovd Ap + [Av, 9A,]) (6.49) 
Hence, the first order variation of the Yang-Mills action $ = E f K( Fup, F"")d'z is given 
by 


n 1 
| 92e 


ôS T K(OF yy, FH” )d x 
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1 
e2 


RE &(6A,, 0, F"" + [A,, F""])d'z + surface terms (6.50) 
2 H u 


I K(0,6A, + [An 6A,], F” )d x 


where we have made use of the associativity of the Killing form. Neglecting the surface 
terms (assuming that the solutions are sufficiently smooth and fall off sufficiently fast at 
infinity), and requiring that 6S = 0 for all variations 6A, we obtain the Yang-Mills field 
equations 
DEC Ay P= 0 (6.51) 
or equivalently 
DBS) (6.52) 


where here D,, is the adjoint covariant derivative. 


6.3.3 The Bianchi Identity 


There is a non-abelian generalization of the identity e. fup = 0 in electromagnetism. 
Let D, be the adjoint covariant derivative. Consider 


DyuFup = O,(0, Ap = 05A, A [AL, A,]) 3b [Aa 0, A, 05A, J5 [A,. A,]| 
= 0,,(O,Ap — 05A) — ([An, 05A] + [Ap, 0,A,]) 
=i [An 0, Ap] E [Av, Oy Ap] 3E [Au, [AL, Ap] (6.53) 


Consider the contraction €"? D, F,,. As the terms ([A4, 05A,] + [A5, 0. Av]) are sym- 
metric in p, p and the terms [A,, 0, Ap] + | Aj, 0, Aj] are symmetric in u, v, these terms give 
no contribution to €"? D), Fy». Also, 


eA. [AL, Ap] = 0 (6.54) 
from the Jacobi identity, and just as for electromagnetism, 
Pa OA OA (6.55) 


because the partial derivatives commute with each other. Hence we find eò!” °DFyp = 0, 
or equivalently 
Diya + Dobra Pc (6.56) 


This equation is called the Bianchi identity. 

This can be used to prove a Jacobi identity for all covariant derivatives. Suppose that 
D,, is the covariant derivative associated with the representation D, and ® € V transforms 
under the representation D as ® — ®’ = D(g)®. Let d be the induced representation of 
L(G). 

Then note that 


D,Dy® = D,(O + d(Ay)®) 
— 0, (019 + d(Ay)S) + d(A,)O,® + dLA,)d( A) 
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= 0,049 + d( Ax)0,9 + d(A,)O,® + (d(O, Ax) + d(A,)d(Ay))® — (6.57) 
and hence 


[D,, Dj] 9 = d(F,,)® (6.58) 
'Then 


[D,, [Dv, Dal]® = Du(d(FLy)®) — d(F;j3) D, 
= ô (d(FLy)®) + d(Ay)d( Fy) ® — d(FiX)(0,9 + d(Ay)®) 
= (Qud( Fy) F [d(A,), d(F,)])® 
= d(Di4 F,)® (6.59) 


where pie denotes the adjoint covariant derivative. 
Hence 


(in. [D,, Dal] RE [Dj [D), Dll + [D), [D,.Dill) ® E d( D2 E, + DIG Fy, + DS? ue 
=0 (6.60) 


using the Bianchi identity on F. As this must hold for all ® we obtain the Jacobi 
identity for covariant derivatives: 


[Dus [Dv, Dal] + [Dv [DA Dull + (Dx; [Du Del] = 0 (6.61) 


6.4 Yang-Mills Energy-Momentum 


The energy momentum tensor of the Yang-Mills field strength is 


1 
Teh ee) qk Fo, FP?) (6.62) 


This differs from the canonical energy-momentum tensor by a total derivative. Observe 
that Ty, is gauge invariant by construction. 


Proposition 35. If the Yang-Mills field strength F satisfies the Yang-Mills field equations, 
the energy-momentum tensor satisfies 


9"T,, =0 (6.63) 


Proof 


Ty = Os Fya, Pò) — GOW Foo, F^") 


1 
&(O^ Fay, F,^) + &( Fun, 0 F^) — 5 (0, Foo, FP?) 
= &(D"F, — [A", Fy], F^) + k(F,i, DY,” — [A", F>) 
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1 
— 9 CDs Foo SAL Foo], PT) 


(6.64) 


where here D,, is the adjoint covariant derivative. Note that by the Bianchi identity 


K(Fua, DEF >) = &(Fuy, -DF — D*F*,) 
K(Fya, Dp FP) — 6( Fy, D E^) 


and so 


1 
K( Fun, DAE,^) = 5h Fux, DvP) 
Hence 


MT yy = K(D" Fy, F^) — k([A", Fun), F^) — 6( Fun; [A", F7^]) 
1 
+ a (Av, Foo], F^") 


However, using the associativity of the Killing form 


F([Av, Foo], F) = r(A, [Foo F**]) = 0 
and 
K([A", Fyah F/^) + «(Fur [A", E/^]) = &(AP, Fu]; Fiò) 8E, 4], F^) 
=0 
hence 


OT.) = KD EA Fò) = 0 


from the Yang-Mills field equations, as required. BM 


6.5 The QCD Lagrangian 


(6.65) 


(6.66) 


(6.67) 


(6.68) 


(6.69) 


(6.70) 


The QCD Lagrangian consists of a non-abelian Yang-Mills gauge theory coupled to fermions. 


The gauge group is G = SU(3) colour. Take generators T; of G; as G is compact, these 


may be chosen so that the Killing form has components 
Kab = —Oab 


where a = 1,...,8 (SU(3) is an 8-dimensional Lie group). 
The Lagrangian is given by 


6 


(6.71) 


8 6 
1 : = 7 
Loop = —5 Y | FFM +i) DJa + eA (T) 49) Wy -Y mp VF Vay (6.72) 
a=1 f=1 


f 
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Here we have decomposed the non-abelian gluon field strength F and potential A into 


components 
Fup = Foyla, Ay = Aba (6.73) 
where 
Fo, = 0,45 — 0, At + e[Ay, An]? = 0,A% — 0, AC + ecc" AP, AS (6.74) 
The indices f = 1,...,6 are flavour indices; whereas the indices A,B = 1,2,3 are 


SU(3) colour indices. The SU(3)colour structure constants are ca^. my is the mass of the f- 
flavour quark. The SU(3) gauge coupling is e; observe that the gauge potentials have been 
re-scaled as mentioned previously in order to remove the gauge coupling factor from the 
Yang-Mills action; although this means that e now appears in the fundamental covariant 
derivative via ô, + eA,, and also in the gauge transformations via 


Ay > gAug | = e~! gg! (6.75) 


where g € G. 

The V 4r are fermionic fields associated to ue quarks- they are 4-component Dirac 
spinors. We set v4 = = 648 Wer, and vc (wet? 

Gauge ee 

The Yang-Mills term —1 eae Pipe $k(Fuv, F"") in the QCD action is auto- 
matically gauge invariant by construction. 


It remains to consider the terms involving fermions: the fermions transform under the 


fundamental representation of SU(3)cotour via 


V Ar > ga” (x) Vey (6.76) 


where g(x) € SU(3). It is then straightforward to see that the covariant derivative 
6470, + e(A,,) AP) Vp; then transforms as 
H m f 


(6479; + e(Ay) a?) Wer — ga“ (50? Oy + e(Ay)o?) Var (6.77) 


Also, the ve transform as 
V — Tors” (g*) p9 (6.78) 
Hence we see that under these gauge transformations 
V4, > Dgr (g*) pe gal Wo; 


= Vy, (6.79) 


where g € SU(3) implies ó4P(g*) p9g4€ = 62°. 
Similarly, under this transformation 


V4 (84P0, + (Apa? Wap — Voró^" (g*) p? gal (6070, + e(Ap)o?) Vay 
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= V4 (540, + e(Au) AP)Up; (6.80) 


It follows that all terms in the Lagrangian are gauge invariant. 
Equations of Motion 


There are two sets of equations of motion. Varying the gauge potentials A7 we obtain 
the variation of the action 


6 


6S = f dtr 6A? (0, F*"" + e[A,, FH]? + ie V ^ WI" (T,) AP py) (6.81) 
f=l 
from which we obtain 
6 
O,F""" + ea" ALEPH + ie S$” Uy" (Ta) P Way = 0 (6.82) 
f=l 
Defining 
8 6 
J” = —ie S X WE" (Ta) AP pg T, (6.83) 
a=1 f=1 
this can be rewritten as 
D,F"" = J” (6.84) 


Observe that D,,J" = 0 as a consequence of Lemma 14. 

There are also fermionic equations of motion; these may be obtained by varying ve 
and V pr in the action. These can be varied independently; from the variation of v (not 
varying V py) we obtain the equation: 


i" (5470, + eA% (T4) a”) Vp; T mW A; —0 (6.85) 


Next vary V p; (not varying V^)- then the action variation is 


óS — nz (iis ano, + eA% (Ts) a”) = yf oua; 
= / dx (- ið V dA" 54P + ie ey" AS (Ta) a? — má) óVp, (6.86) 
Hence we obtain 


—i0, Wi" 84P + ie" AT (T;) AP — mp UP =0 (6.87) 


Take the hermitian transpose of the above equation we obtain 


iy^ OUP — iey y ALT; a2 UF — m7 V? =0 ud 


where we have made use of the identity 4"! = y°7"7°. As Ta is anti-hermitian, we 
therefore recover (6.85). 
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